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Description 

FIELD OF THE INVENTION 

2e, £ S™nSr ,eS ,0 ' P ° VnUde0,ide enC ° din9 8 ^ Pr ° tein ' 3 Pr0tei " e ™ ded bv *• 
BACKGROUND OF THF INVENTION 



£S ? "T* sequencin 9 P r °j ec,s ' the determination and analysis of the genomic DNA of various livino or 
gamsms have been m progress all over the world. The whole genomic sequences of more than 10 sp^iTs of proZ 

whTch il I!' VeaSt ' 3 mUltiCe " Ular eUkary0 ' e ' C - eleganS are alreadv **™-ed. As oZan genome" 

btn , ' 1 P ? . 6 COmP ° Sed °' ,hr6e th ° USand mi " i0n base P aire ' the wortd wide cooperative projects Save 
* oiZlZ V , ^T* Wh °' e S,rUC,Ure iS predic,ed t0 be dete ™<* ^ the yeaS aoof^e aim 

SI olnT 9en °T iC SeqUenCe " t0 feVeal ,he ,UnCti0ns of a » 9 enes and their regulation and to unde^tand 

S Z IZ^T? between 9 enes . P"**.. cells or individuals through deducing So" 

bUSZ .hf „ ' ? ,S 8 Uepnm °' ' he high ' y com P |icated ^ing organisms. To understand living Lan isms 
by u tang he genom.c information from various species is not only important as an academic subject birt also 
significant from the viewpoint of industrial application. acaaemic suDject, but also socially 

*> [0003] However, determination of genomic sequences itself cannnot identify the functions of all oenes For Pyflmnlo 

s^l^w^ haK ° f ^ 6000 ib * p2SS2SS th°r g X e a no P m c 

hZZ?!h ! J. ? 6d ^ f ° r human ' the number of ,he 9 enes « predicted to be approximately one 
S Th f re ' " fe dGSirable t0 68,315,18(1 " a hi 9 h ,hrou 9"P ut analysis system of the glTfuncLns" 
- sequent ^ " - ,UnC,i ° nS ° f ~ ^ « 9eneS ° btained ^he genoml 

[0004] Many genes in the eukaryotic genome are split by introns into multiple exons Thus it is difficult to nrpriirt 

cDNA? h" 3 h T J ° f ^ and in, ° rmati0n around the 5 '" end of mRNA ^ extreme^ 
* Soo anHir 1 Th r^ 0 "*" 9 m ^ As "nose encoding protein sequences are deduced is approximlTety 
7000, and further, the number of full-length therein is oniy 5500. Thus, even including cDNA registered aT^T the 
percentage of human cDNA obtained so far is estimated to be 10-15% of all the gene? ' 
[0006] It .s possible to identify the transcription start site of mRNA on the genomic seouence based on th* v d „h 
sequence of a full-length cDNA, and to analyze factors involved in the b^V^^^ ^ZLZ he cDNA 

» :::: r gu ati ° n of expression at ,he since . 

>o start s.te, in the 5'-reg,on, it can be translated into a protein in a correct frame Therefore it fe oosstole to 

anaooToril ^ Pr ° ,ein * "» ^ °' to ana * 2e b <°^ ac,ivi * - ^^SP£S^ 

ZTZT n TT" SyS,6m ThUS ' ana ' ySiS ° f 3 fU,Men9th cDNA P rovid * > valuableLrmatL Sch comple 
Zl e ?„ T T T 9en ° me •«««**«■ A,s °. '""-length cDNA clones that can be expressed are « 
valuable in emp.ncal analysis of gene function and in industrial application xpressec are extremely 

, PartiCUlar ' HUman SeCret0rV Pr ° ,einS or membrane proteins are would be useful by itself as a medicine like 
tesue plasminogen actuator (TPA), or as a target of medicines like membrane receptors. I additio n qenest *S 

dtera~ K r 38 3 med ' C,ne ' 3nd alSO tne clones Potentially include a gene associated with 
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SUMMARY OF THE INVENTION 

[0009] An objective of the present invention is to provide a primer that enables synthesizing polynucleotide from 
human, the resulting polynucleotide or its clone, and a protein encoded by the polynucleotide. 

5 [0010] The inventors have developed a method for efficiently cloning a human full-length cDNA that is predicted by 
the ATGpr etc. to be a full-length cDNA clone, from a full-length-enriched cDNA library that is synthesized by the oligo- 
capping method. Then, the inventors determined the nucleotide sequence of the obtained cDNA clones from both 5'- 
and 3'- ends. By utilizing the sequences, the inventors selected clones that were expected to contain a signal by the 
PSORT(Nakai K. and Kanehisa M. (1992) Genomics 14: 897-911), and obtained clones that contain a cDNA encoding 

10 a secretory protein or membrane protein. Moreover the inventors specifically selected full-length cDNA clones that 
encode secretory or membrane proteins, signal transduction-associated proteins, glycoprotein-associated proteins, 
transcription-associated proteins, or disease-associated proteins from clones homologous to the clones in the Swiss- 
Prot (http://www.ebi.ac.uk/ebi_docsSwissProt_db/swisshome.html) according to the keywords of SwissProt. 
[0011] The full-length cDNA clones of the present invention have high fullness ratio since these were obtained by 

15 the combination of (1) construction of a full-length-enriched cDNA library that is synthesized by the oligo-capping 
method, and (2) a system in which fullness ratio is evaluated from the nucleotide sequence of the 5'-end (in this system, 
clones are selected based on the estimation by the ATGpr, following the removal of sequences judged not to be full- 
length when compared with ESTs). However, the primers of the present invention enable obtaining full-length cDNA 
easily without any special methods mentioned above. 

20 [0012] Homology analysis in which the analysis is carried out against a non-full-length cDNA fragment to postulate 
the function of a protein encoded by said fragment, is being commonly performed. However, since such analysis is 
based on the information of the fragment, it is not clear as to whether this fragment corresponds to a part that is 
functionally important in the protein. In other words, the reliability of the homology analysis based on the information 
of a fragment is doubtful, as information relating to the structure of the whole protein is not available. However, the 

25 homology analysis of the present invention is conducted based on the information of a full-length cDNA comprising 
the whole coding region of the cDNA, and therefore ; the homology of various portions of the protein can be analyzed. 
Hence, the reliability of the homology analysis has been dramatically improved in the present invention. 
[0013] The inventors completed the invention by finding that it is possible to synthesize a novel full-length cDNA by 
using the combination of a primer that is designed based on the nucleotide sequence of the 5' -ends of the selected 

30 full-length cDNA clones and any of an oligo-dT primer or a 3'-primer that is designed based on the nucleotide sequence 
of the 3'-ends of the selected clones. 

[0014] Thus, the present invention relates to primers described below, a method for synthesizing a polynucleotide 
using the primers, and polynucleotides obtained by the method. 
[0015] First, the present invention relates to 

35 

(1) use of an oligonucleotide as a primer for synthesizing the polynucleotide comprising the nucleotide sequence 
set forth in any one of SEQ ID NOs: 1 -829 and 2545, or the complementary strand thereof, wherein said oligonu- 
cleotide is complementary to said polynucleotide or the complementary strand thereof and comprises at least 15 
nucleotides; 

40 (2) a primer set for synthesizing polynucleotides, the primer set comprising an oligo-dT primer and an oligonucle- 

otide complementary to the complementary strand of the polynucleotide comprising the nucleotide sequence set 
forth in any one of SEQ ID NOs: 1-829 and 2545, wherein said oligonucleotide comprises at least 15 nucleotides; 
and 

(3) A primer set for synthesizing polynucleotides, the primer set comprising a combination of an oligonucleotide 
45 comprising a nucleotide sequence complementary to the complementary strand of the polynucleotide comprising 

a 5'-end nucleotide sequence and an oligonucleotide comprising a nucleotide sequence complementary to the 
polynucleotide comprising a 3'-end nucleotide sequence, wherein said oligonucleotides comprise at least 15 nu- 
cleotides and wherein said combination of 5'-end nucleotide sequence / 3'-end nucleotide sequence is selected 
from the combinations of 5'-end nucleotide sequence / 3'-end nucleotide sequence set forth in the SEQ ID NOs 
50 in Table 1 . 

[0016] Table 1 shows names of clones obtained in the examples described later, comprising the polynucleotide of 
the present invention (830 clones), names of nucleotide sequences at the 5'-end and 3'-end of the full-length cDNA, 
and their corresponding SEQ ID NOs. A blank indicates that the of the 3'-end sequence corresponding to the 5'-end 
55 sequence has not been determined the same clone. 

[0017] The SEQ ID NO of a 5'-end sequence is shown on the right side of the name of the 5'-end sequence, and the 
SEQ ID NO of a 3'-end sequence is shown on the right side of the name of the 3'-end sequence. 
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Table 1 



w 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Correspondence 
Name of clone 

BNGH41 000020 
BNGH41 000087 
BNGH41 000091 
HEMBA1 000006 
HEMBA 1 000121 
HEMBA1000128 
HEMBA 1000275 
HEMBA1 000300 
HEMBA 1000349 
HEMBA1 000443 
HEMBA1 000462 
HEMBA1 000477 
HEMBA 1 000590 
HEMBA1 000634 
HEMBA1 000671 
HEMBA1000713 
HEMBA1 000732 
HEMBA1 000745 
HEMBA1 000835 
HEMBA1000875 
HEMBA1 000907 
HEMBA1 000940 
HEMBA1 000962 
HEMBA1001184 
HEMBA1001221 
HEMBA1001228 
HEMBA1001272 
HEMBA1001296 
HEMBA1001297 
HEMBA1001390 
HEMBA1001563 
HEMBA1001621 
HEMBA1001878 
HEMBA1001886 
HEMBA1 002048 
HEMBA1 002131 
HEMBA1002163 
HEMBA1002164 
HEMBA1002167 
HEMBA1002178 
HEMBA1002195 
HEMBA1 002227 
HEMBA1 002239 
HEMBA1002316 
HEMBA1 002420 
HEMBA 1002421 
HEMBA1 002524 



between names of clone and the sequence name, and the SEQ ID. 
Name of ^sequence \ SEQ ID [ 3 Name of 3'-sequenceTsiol5" 



F-BNGH4 1000020 
F-BNGH41 000087 
F-BNGH41 000091 
F-HEMBA1 000006 
F-HEMBA1 000121 
F-HEMBA1 000128 
F-HEMBA1 000275 
F-HEMBA1 000300 
F-HEMBA1 000349 
F-HEMBA1 000443 
F-HEMBA1 000462 
F-HEMBA1 000477 
F-HEMBA1 000590 
F-HEMBA1 000634 
F-HEMBA1 000671 
F-HEMBA1 000713 
F-HEMBA1 000732 
F-HEMBA1 000745 
F-HEMBA1 000835 
F-HEMBA1 000875 
F-HEMBA1 000907 
F-HEMBA1 000940 
F-HEMBA1 000962 
F-HEMBA1001184 
F-HEMBA1001221 
F-HEMBA1 001228 
F-HEMBA1001272 
F-HEMBA1 001296 
F-HEMBA1001297 
F-HEMBA1001390 
F-HEMBA1001563 
F-HEMBA1001621 
F-HEMBA1001878 
F-HEMBA1001886 
F-HEMBA 1002048 
F-HEMBA1 002131 
F-HEMBA1002163 
F-HEMBA1002164 
F-HEMBA1002167 
F-HEMBA1 002178 
F-HEMBA1002195 
F-HEMBA1 002227 
F-HEMBA1 002239 
F-HEMBA1002316 
F-HEMBA1 002420 
F-HEMBA1 002421 
F-HEMBA1 002524 



1 

2 






3 






4 


f ' riciviDMl UO000D 


830 


5 


i 1 > ft C 1 VI DM 1 UUU 1 c. 1 


831 


6 


1 R-HEMRAinnniQQ 


832 


7 


R-HEMBA1 000275 


833 


8 


R-HEMBA1 000300 


834 


q 


n-nnnnnnnnnnnn 


835 


l u 






11 


i R-H PMR A i nr\r\ a do 

| •» ntiviDA i 000462 


836 


12 


| n nciviDMl 0004/ / 


j 837 


13 


i R-H PMR A ^ f\f\r\cr\n 

i n-ntiviDA 1000590 


838 


14 


R-H P MR A -t r\nncn a 

I n-ntivl t>A 1000634 


\ 839 


15 


n-nciviDA I OOOo/ 1 


840 


16 


n-nciviDA 1 000/ 1 3 


841 


17 

I r 


R.MPMDAinon7n/t 

n-n tfviDAi 000732 


842 


18 

19 


n-nnnnnnnnnnnn 


843 


20 


R-H PM R A H AA("in7r 

n-ntivlDA 1 000875 


844 


21 i 
i 






22 


n-ncivlDA 1 000940 


845 


23 


n-ntriviDAI 000962 


j 846 


24 


n-nciviDMl 00 1 1 84 


847 


25 

C w I 


n-ntMoAi 001221 


848 


26 


R-MPMRA1 nn-t ooo 
n ncrviDA 1 001 


849 


27 


R-HEMBA1001272 


850 


28 


R-HEMBA1001296 


851 


29 


R-HEMBA1 001297 


852 


30 


R-HEMBA1001390 I 


853 


31 


R-HEMBA1001563 


854 


32 


R-HEMBA1001621 


855 


33 


R-HEMBA1001878 


856 


34 


R-HEMBA1001886 


857 


35 


R-HEMBA1 002048 


858 


36 


R-HEMBA1002131 


859 


37 


R-HEMBA1002163 


860 



38 

39 

40 

41 

42 

43 

44 

45 

46 

47 



R-HEMBA1002167 
R-HEMBA1002178 
R-HEMBA1002195 
R-HEMBA1 002227 

R-HEMBA1002316 
R-HEMBA1 002420 
R-HEMBA1 002421 
R-HEMBA1 002524 



861 
862 
863 
864 

865 
866 
867 
868 
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Table 1 (continued) 





Correspondence between names of clone and the sequence name, and the SEQ ID. 




Name of clone 


Name of 5' -sequence 


SEQ ID 


3 Name of 3' -sequence 


SEQ ID 


5 


HFMRA1 00PRR1 

F 1 L IVI U r\ 1 UU£.<JO 1 


F-HFMRA1 00PR51 

1 1 l L_ 1 VI UAA 1 VUCJJ 1 


4R 


R-HFMRA1 00PRR1 

III I L. IVI UrA 1 UUi- JU 1 


RR9 




HFMRA100P767 


F-HEMBA1 002767 


49 


R-HFMRA1 00P767 


870 




HFMBA1 002985 


F-HEMBA1 002985 

1 1 1 L IVI UrA 1 VVbyOtl 


R0 


R-HFMBA1 002985 

1 1 1 1 1 VI UrA 1 \J\J^ \7\s\J 


871 

O / I 




HEMBA1 002992 


F-HEMBA1 002992 


51 

wJ 1 






10 


HEMBA1 003047 


F-HEMBA1 003047 

1 \ 1 1_ IVI X— J fx 1 uuUu " / 


5P 


R-HEMBA1 003047 


87P 


HEMBA1003072 


F-HEMRA1 003072 


R3 


R-HFMBA1 003072 

11 1 1 1_ 1 V 1 UrA 1 \J Vu KJ 1 C 


873 




HFMBA1003101 

1 1 L I VI UrA 1 \J\J\J 1 \J 1 


F-HFMBA1 0031 01 

1 1 1 1 IVI UrA 1 VVd 1 \J 1 


R4 


R-HFMRA1003101 

n 1 1 LIVI U/A 1 uWO IVI 


874 




HFMBA1003120 

1 1 L_ IVI UrA 1 \J\J\J 1 C\J 


F-HEMBA1 0031 20 

1 1 II — IVI Un 1 UUU 1 C\J 


RR 


R-HFMRA1003120 

1 1 1 » L_ 1 VI UrA 1 \J\J\J i C-\J 


87R 




H EMBA1 003230 

1 II — 1 V 1 \—>l \ 1 UuULuU 


F-HEMBA1 003230 


56 


R-HEMBA1 003230 

1 1 1 1 1 — IVI U f \ 1 wVvlbuv 


876 


15 


HEMBA1 003294 


F-HFMBA1 003294 

1 1 1 L_ IV* vJw \ 1 UvUuv" 


57 


R-HEMBA1 003294 

111 \ U> 1 V 1 \ 1 WvOH C7" 


877 




HFMBA1003315 

i 1 1— ivi un i Uvjj i 


F-HFMRA1 003315 

1 1 1 1— IVI UAA 1 vuUU 1 *J 


58 


R-HFMBA1003315 

1 1 1 1 L. IVI UrA 1 WOO 1 O 


878 




HFMRA1 003392 

1 IL IVI UAA 1 UVJOOCrt. 


F-HFMRA1 003392 

I n CIVI DM 1 UUOU3/- 


R9 


R-HFMRA1 003392 

III 1 LIVI UAA 1 UUOOj;. 


R79 

Of <3 




HFMRA1 003399 

1 1 L IV) UAA 1 


F-HFMRA1 001399 
i n c ivi dm i \j<jooz?<j 


R0 
ou 


R-HFMRA1 00*a*a99 
n n cividm i uuooc7\7 


RRO 
oou 


on 


HFMRA1 003487 

n LIVI DAA I UwOtO/ 


F-HFIUIRA1 009487 

t n CIVIDM I UUOHO/ 


R1 

D 1 


R-HFMRA1 00^487 
n n cividm i uuoho/ 


RR1 

OO I 


H FMRA1 003497 

f 1 LIVI UAA 1 \J\J\J*T\J f 


F-HFMRA1 003497 

1 1 IL. IVI UAA 1 VVOtiJl 


RP 


R-HFMRA1 003497 

(I 1 1 LIVI UAA 1 


RRP 




HFMRA1 003R30 

I 1 L IVI UAA 1 UUOOOU 


F-HFMRA1 003R30 

I 1 1 L IVI UAA 1 UUUJUU 


63 
uo 


R-HFMRA1 003R30 

III 1 LIVI UAA 1 UUOuOU 


RR3 
ooo 




HFMRA1 003R02 

1 ILIVIDn 1 UUJUUt 


F-HFMRA1 003R02 

r 1 1 L IVI UAA 1 UUOUUzC 


64 


R-HFMRA1 003ROP 

III 1 LIVI UAA 1 UUOUuC 


884 




HFMRA 1003732 

1 1 CIVI DM 1 UUJ / \jc. 


F-HFMRA1 00973P 

ill CIVIDM 1 UUO / 0£* 


6R 


R-HFMRA1 00^73P 

n 11 L IVI DM 1 UUO / 


RRR 

OOO 


25 


HFMRA100394R 

II L IVI DAA I UUOi7*4iJ 


P-HFMRA1 00994R 

i n CIVIDM 1 UUOSHu 


6R 


R-HFMRA1 00^94R 
n n cividm i uuojjho 


RRR 
ooo 




H F M R A 1 004007 
n civi dm i uutyu / 


F-HFMRA1 004007 

I II CIVIDM I UU*tUUi 


R7 
O/ 


n n civi dm i ulihuu / 


RR7 
OO/ 




H F M R A 1 0040R7 

n C IVI Dn 1 UU4 uo / 


F-HFMRA1 0040R7 

I II CIVIDM 1 UUHUO/ 


RR 

DO 








HFMRA100408R 

1 1 C IVI DAA 1 UU*-tUO J 


F-HFMRA1 0040RS 

1 1 1 L IVI UAA 1 UUHUOU 


69 


R-HFMRA1 00408R 

n 1 IL IVI UAA 1 UUHUOJ 


888 

OOO 


30 


HFMRA1004110 

ntlVIDn IUUH 1 1 U 


F-HFMRA1 0041 1 0 

1 r IL IVI UAA 1 UUn 1 1 \J 


70 


R-nnnnnnnnnnnn 

Pi 1 1 1 1 1 1 1 II 1 1 1 i II 1 1 II 1 1 1 1 1 


RR9 


H F M R A 1 0042R0 

n C IVI DM 1 UUHtOU 


P.MPMRA 1 nn4PRH 
I n CIVIDM I \J\JH£.Z>\J 


/ l 


R.MFMRA1 nH4PRH 
nn cividm i uuh<iou 


RQfi 

OC7U 




MPMRA1 004^91 

II L IVI DM I UUHOS I 


F-MPMRA1 004^91 

i ll CIVIDM 1 U\JH\jZj 1 


7P 


R-MFMRA1 004*^91 
nn civi dm i UU4J y i 


R91 




H F M R A 1 004444 
n CI VI DM I UU't'Wt 


F-MFIUIRA1 004444 
r-n civiDM i uu*f*+*f*t 


7^ 


R-HFMRA1 004444 
n n civi dm i uu'W'rt 


RQP 




HFMRA1 0044R4 

n L IVI DAA I \J\JHH*J t * 


F-HFMRA1 0044^4 
nn cividm i uuftw 


74 


R-MFMRA1 0044R4 
nn civi dm i uuh^uh 


R93 


35 


H FMRA1 004R0R 

1 1 L IVI DAA 1 VJL/H-iJUiJ 


F-HFMRA1 004R0R 

l \ IL IVI UAA I UU4JUJ 


7R 


R-HFMRA1 004R0R 

fill L IVI UAA 1 UUHuUJ 


R94 




H FMRA1 0047BR 

1 1 L IVI UAA 1 Vu4 / OO 


F-HFMRA1 0047RR 

r n CIVIDM I UUHf 0>J 


76 


R-HFMRA1 0047RR 
n n cividm i \jyjHf dj 


R9R 
Oau 




H FMRA1 004797 

11 C IVI DM 1 UUH/ 57/ 


F-MFMRA1 004797 
i n C IVI DM 1 UUf/ 


77 


R-M FMR A 1 004797 
nn civi dm i uu*t/y/ 


RQR 




HFMRA1 0049R2 

n L IVI DM I UUfiJO/l 


F.HPMRA1 nn49R2 
r-n CIVIDM I \J\J**aD/l 


7R 


R-HFMRA1 0H4QRP 
nn l i vi dm i uuh5?o^ 


RQ7 
057 / 


40 


H FMRA1 004971 

n CIVIDM I UUHl? / f 


F-HFMRA1 004971 

i n CIVIDM I UU*+I7/ 1 


79 


R-H FMR A 1 004Q71 
n-n cividm i uuty/ i 


RQR 

0i70 


H FMRA1 0049R2 


F-HFMRA1 0049R2 

i n C IVI DM 1 UUtaut 


RO 


R-HFMRA1 0049RP 
n n cividm i uuttjo^ 


R99 




HFMRA100R070 

1 1 [ IVI UrA 1 UUJU / \J 


F-HFMRA1 00R070 

1 n L IVI DM 1 UUjU/ \j 


81 

O 1 


R-H FMRA1 00R070 
nn cividm i i \j 


900 

9UU 




HFMRA1 00R0R4 

1 1 L 1 V i UrA 1 UUJUO'i 


F-HFMRA1 00R0R4 

i n CIVIDM 1 UUjUO't 


RP 


R-HFMRA1 00R0R4 
nn cividm i uuuuot 


901 

3U 1 




HFMRA1 00R14R 
n L IVI DM l UUO I *+o 


P.HPMRA1 nnR14R 
i n CIVIDM 1 \J\JO i *+o 


R^ 
OO 


R MFMRA1 nn*^14^ 
n-n cividm i uuo i ho 


yut 


45 


H FMRA1 00R23O 

n CIVIDM I UUOtOU 


F-HFMRAI nnRP^H 
rnciviDM i uuotou 


R4 
OH 


R-HFMRA1 nnRP^n 
n-n cividm i uuocou 


i*UO 




HFMRA1 nnRP4R 

n L_ IVI DM 1 UUOcHD 


F-HFMRA1 00RP4R 
rn CIVIDM l UUO^HO 


RR 


R.MFMRA1 nnRP4R 
nn cividm i uuo^ho 


QH4 




HPMRA1 nnRPR7 
n CIVIDM I UUOcDr 


F.MFMRA1 OORPR7 
i n CIVIDM l UUOdur 


RR 
OD 


R.MFMRA1 nnRPR7 
n ncivi dm i uuocu/ 


QOR 




HEMBA1 005337 


F-HEMBA1 005337 


87 


R-HEMBA1 005337 


906 


50 


HEMBA1 005430 


F-HEMBA1 005430 


88 


R-HEMBA1 005430 


907 


HEMBA1 005449 


F-HEMBA1 005449 


89 


R-H EMBA1 005449 


908 




HEMBA1 005489 


FnEMBAI 005489 


90 


R-HEMBA1 005489 


909 




HEMBA1 005522 


F-HEMBA1 005522 


91 


R-H EMBA1 005522 


910 




HEMBA1 005545 


F-HEMBA1 005545 


92 


R-H EMBA1 005545 


911 


55 


HEMBA1 005698 


F-HEMBA1 005698 


93 


R-HEMBA1 005698 


912 




HEMBA1005913 


F-HEMBA1 005913 


94 


R-HEMBA1005913 


913 




HEMBA1 005929 


F-HEMBA1 005929 


95 


R-HEMBA1 005929 


914 
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Correspondence 
Name of clone 



Table 1 (continued) 



HEMBA1 005945 
HEMBA1006016 
HEMBA1006171 
HEMBA 1006276 
HEMBA1 006299 
HEMBA1006311 
HEMBA1 006335 
HEMBA1 006357 
HEMBA1 006430 
HEMBA1 006482 
HEMBA1006517 
HEMBA1 006544 
HEMBA1 006572 
HEMBA1006658 
HEMBA1 006707 
HEMBA1 006724 
HEMBA1 006749 
HEMBA1 006770 
HEMBA1 006902 
HEMBA1006912 
HEMBA1006916 
HEMBA1 006960 
HEMBA1007013 
HEMBA1 007057 
HEMBA1 007063 
HEMBA1 007226 
HEMBA1 007241 
HEMBA 1007291 
HEMBA1 007332 
HEMBB1000106 
HEMBB1 000276 
HEMBB1 000309 
HEMBB1 000407 
HEMBB1 000447 
HEMBB1 000542 
HEMBB1000567 
HEMBB1 000642 
HEMBB1000668 
HEMBB1 000679 
HEMBB1000881 
HEMBB1000905 
HEMBB1001026 
HEMBB1001048 
HEMBB1001200 
HEMBB1001407 
HEMBB1001530 
HEMBB1001547 
HEMBB1001573 



between names of clone and the sequence name, and the SEQ ID 



Name of 5'-sequence SEQ ID 



F-HEMBA1 005945 I 96 
F-HEMBA1006016 97 
F-HEMBA1006171 98 
F-HEMBA1 006276 99 
F-HEMBA1 006299 100 
F-HEMBA1 006311 101 
F-HEMBA1 006335 102 
F-HEMBA1 006357 103 
F-HEMBA1 006430 104 
F-HEMBA1 006482 105 
F-HEMBA1006517 106 
F-HEMBA1 006544 107 
F-HEMBA1 006572 108 
F-HEMBA1 006658 109 
F-HEMBA1 006707 110 
F-HEMBA1 006724 111 
F-HEMBA1 006749 112 
F-HEMBA1 006770 113 
F-HEMBA1 006902 114 
F-HEMBA1006912 115 
F-HEMBA1006916 116 
F-HEMBA1 006960 117 
F-HEMBA1007013 118 
F-HEMBA1 007057 119 
F-HEMBA1 007063 120 
F-HEMBA1 007226 121 
F-HEMBA1 007241 122 
F-HEMBA1 007291 123 
F-HEMBA1 007332 124 
F-HEMBB1000106 125 

F-HEMBB1 000276 126 

F-HEMBB1 000309 127 

F-HEMBB1 000407 128 

F-HEMBB1 000447 1 29 

F-HEMBB1 000542 130 

F-HEMBB1 000567 131 

F-HEMBB1 000642 132 

F-HEMBB1 000668 j 133 

F-HEMBB1 000679 134 

F-HEMBB1 000881 135 

F-HEMBB1 000905 136 

F-HEMBB1001026 137 

F-HEMBB1001048 138 

F-HEMBB1 001200 139 

F-HEMBB1001407 140 

F-HEMBB1 001530 141 

F-HEMBB1001547 142 

F-HEMBB1001573 j 143 



3 Name of 3'-sequence I SEQ ID 



H-HEMBA1 005945 j 915 
R-HEMBA1006016 916 
R-HEMBA1006171 917 
R-HEMBA1006276 918 
R-HEMBA1 006299 919 
R-HEMBA1 006311 | 920 
R-HEMBA1 006335 921 
R-HEMBA1 006357 922 
R-HEMBA1 006430 923 
R-HEMBA1 006482 924 
R-HEMBA1006517 925 
R-HEMBA1 006544 926 

R-HEMBA1 006572 927 

R-HEMBA1 006658 928 

R-HEMBA1 006707 929 

R-HEMBA1 006724 930 

R-HEMBA1 006749 931 

R-HEMBA1 006770 932 

R-HEMBA1 006902 933 

R-HEMBA1 006912 934 

R-HEMBA1006916 935 

R-HEMBA1 006960 936 

R-HEMBA1 007013 937 

R-HEMBA1 007057 938 
R-HEMBA1 007063 | 939 

R-HEMBA1 007241 I 940 

R-HEMBA1 007291 941 

R-HEMBA1 007332 942 
R-HEMBB1000106 j 943 

R-HEMBB1 000276 944 

R-HEMBB1 000309 945 

R-HEMBB1 000407 946 

R-HEMBB1 000447 947 

R-HEMBB1 000542 948 

R-HEMBB1 000567 949 

R-HEMBB1 000642 950 

R-H EMBB1 000668 951 

R-HEMBB1 000679 952 
R-HEMBB1000881 I 953 

R-H EMBB1 000905 954 

R-HEMBB1001026 955 

R-HEMBB1001048 956 

R-HEMBB1001200 957 

R-HEMBB1001407 958 

R-HEMBB1001530 959 

R-HEMBB1001547 960 
R-HEMBB1001573 j 961 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 


Name of 5*-sequence 


SEQ ID 


3 Name of 3'-sequence 


SEQ ID 


HEMBB1001847 


F-HEMBB1001847 


144 


R-HEMBB1001847 


962 


HEMBB1001959 


F-HEMBB1001959 


145 


R-HEMBB1001959 


963 


HEMBB1001978 


F-HEMBB1001978 


146 


R-HEMBB1001978 


964 


HEMBB1 002039 


F-HEMBB1 002039 


147 


R-HEMBB1 002039 


965 


HEMBB1002041 


F-HEMBB1 002041 


148 


R-HEMBB1 002041 


966 


HEMBB1 002051 


F-HEMBB1 002051 


149 


R-HEMBB1 002051 


967 


HEMBB1002120 


F-HEMBB1 002120 


150 


R-HEMBB1002120 


968 


HEMBB1002162 


F-HEMBB1002162 


151 


R-HEMBB1002162 


969 


HEMBB1 002228 


F-HEMBB1 002228 


152 


R-HEMBB1 002228 


970 


HEMBB1 002245 


F-HEMBB1 002245 


153 


R-HEMBB1 002245 


971 


HEMBB1002302 


F-HEMBB1 002302 


154 


R-HEMBB1 002302 


972 


HEMBB1 002427 


F-HEMBB1 002427 


155 


R-HEMBB1 002427 


973 


HEMBB1 002465 


F-HEMBB1 002465 


156 


R-HEMBB1 002465 


974 


HEMBB1 002661 


F-HEMBB1 002661 


157 


R-HEMBB1 002661 


975 


HEMBB1002663 


F-HEMBB1 002663 


158 


R-HEMBB1 002663 


976 


HEMBB1002693 


F-HEMBB1 002693 


159 


R-HEMBB1 002693 


977 


MAMMA 1000046 


F-MAMMA1 000046 


160 


R-MAMMA1 000046 


978 


MAMMA1000102 


F-MAMMA1000102 


161 


R-MAMMA1000102 


979 


MAMMA 1000106 


F-MAMMA1000106 


162 


R-MAMMA1000106 


980 


MAMMA1000118 


F-MAMMA1000118 


163 


R-MAMMA1000118 


981 


MAMMA1000141 


F-MAMMA1000141 


164 


R-MAMMA1000141 


982 


MAMMA1 000204 


F-MAMMA1 000204 


165 


R-MAMMA1 000204 


983 


MAMMA1 000226 


F-MAMMA1 000226 


166 


R-MAMMA1 000226 


984 


MAMMA1 000403 


F-MAMMA1 000403 


167 


R-MAMMA1 000403 


985 


MAMMA1 000449 


F-MAMMA1 000449 


168 


R-MAMMA1 000449 


986 


MAMMA1 000457 


F-MAMMA1 000457 


169 


R-MAMMA1 000457 


987 


MAMMA1 000473 


F-MAMMA1 000473 


170 


R-MAMMA1 000473 


988 


MAMMA1 000496 


F-MAMMA1 000496 


171 


R-MAMMA1 000496 


989 


MAM MA1 000528 


F-MAMMA1 000528 


172 


R-MAMMA1000528 


990 


MAMMA1 000591 


F-MAMMA1 000591 


173 


R-MAMMA1000591 


991 


MAMMA 1000614 


F-MAMMA1000614 


174 


R-MAMMA1000614 


992 


MAM MA1 000652 


F-MAMMA1 000652 


175 


R-MAMMA1 000652 


993 


MAMMA1 000681 


F-MAMMA1 000681 


176 


R-MAMMA1 000681 


994 


MAMMA 1000706 


F-MAMMA1 000706 


177 


R-MAMMA1 000706 


995 


MAMMA 1000788 


F-MAMMA1 000788 


178 


R-MAMMA1 000788 


996 


MAMMA 1000810 


F-MAMMA1 000810 


179 


R-MAMMA1 00081 0 


997 


MAMMA1000814 


F-MAMMA1000814 


180 


R-MAMMA1000814 


998 


MAMMA1 000881 


F-MAMMA1 000881 


181 


R-MAMMA1 000881 


999 


MAM MA1 000986 


F-MAMMA1000986 


182 


R-MAMMA1 000986 


1000 


MAMMA 1000994 


F-MAMMA1 000994 


183 


R-MAMMA1 000994 


1001 


MAMMA1001043 


F-MAMMA1001043 


184 


R-MAMMA1001043 


1002 


MAMMA1001066 


F-MAMMA1001066 


185 


R-MAMMA1001066 


1003 


MAM MA1 001 094 


F-MAMMA1001094 


186 


R-MAMMA1001094 


1004 


MAMMA1001141 


F-MAMMA1001141 


187 


R-MAMMA1001141 


1005 


MAMMA1001150 


F-MAMMA1001150 


188 


R-MAMMA1001150 


1006 


MAMMA1001237 


F-MAMMA1001237 


189 


R-MAMMA1 001237 


1007 


MAMMA1001284 


F-MAMMA1001284 


190 


R-MAMMA1 001284 


1008 


MAMMA1001310 


F-MAMMA1001310 


191 


R-MAMMA1001310 


1009 
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Table 1 (continued) 



J Correspondence between names of clone and the sequence name, and the SEQ ID. 


i Name oi clone 


Name of 5 -sequence 


| SEQ ID 


3 Name of 3'-sequence 


| SEQ ID 


MAMMA1001344 


F-MAMMA1001344 


192 






MAMMA1001418 


F-MAMMA1001418 


193 


R-MAMMA1001418 


1010 


I 11 A till A 4 AAJ rnn 

MAM MA1 001 532 


F-MAMMA1001532 


194 


R-MAMMA1001532 


1011 


MAMMA1001609 


| F-MAMMA1001609 


195 


R-MAMMA1001609 


1012 


MAMMA 1001615 


F-MAMMA1001615 


196 


R-MAMMA1001615 


1013 


MAMMA 1001623 


F-MAMMA1001623 


197 


R-MAMMA1001623 


1014 


MAMMA1 001634 


F-MAMMA1001634 


198 


R-MAMMA1001634 


1015 


MAMMA 1001893 


F-MAMMA1001893 


199 


R-MAMMA1001893 


1016 


1 AAAAaLAAJ A ^ J s% a ^ 

MAMMA1001901 


F-MAMMA1001901 


200 


R-MAMMA1001901 


1017 


MAMMA 1001957 


F-MAMMA1001957 


201 


R-MAMMA1001957 


1018 


MAMMA1001978 


I F-MAMMA1001978 


202 


R-MAMMA1001978 


1019 


1 AAAAAAJA-J^ i-v /-\. — » j-^ 

M AM MA1 002070 


F-MAMMA1 002070 


203 


R-MAMMA1 002070 


1020 


I AAAAAAJA^ *v A /-v « 

MAMMA1 002080 


| F-MAMMA1 002080 


204 


j R-MAMMA1 002080 


1021 


MAMMA 1002087 


F-MAMMA1 002087 


205 


R-MAMMA1 002087 


1022 


I AfAAAlvJAJSt #\^\/\ a j 

MAMMA1 002091 


F-MAMMA1 002091 


206 






1 AAAAJAAA J ^\ a A A/\ r— 

MAMMA 1002095 


F-MAMMA1 002095 


207 


R-MAMMA1 002095 


1023 


MAMMA 1002128 


F-MAMMA1002128 


208 


R-MAMMA1002128 


1024 


MAM MA1 0021 42 


F-MAMMA1002142 


209 


R-MAMMA1 002142 


1025 


MAMMA1002165 


F-MAMMA1002165 


210 


R-MAMMA1002165 


1026 


MAMMA 1002205 


F-MAMMA1 002205 


211 


R-MAMMA1 002205 


1027 


MAMMA 1002224 


F-MAMMA1 002224 


212 


' R-MAMMA1 002224 


1028 


MAM MA1 002234 


F-MAMMA1 002234 


213 


R-MAMMA1 002234 


1029 


MAM MA1 002586 


F-MAMMA1 002586 


214 


R-MAMMA1 002586 


1030 


MAMMA1 002633 


F-MAMMA1 002633 


1 215 


R-MAMMA1 002633 


1031 


MAM MA1 0031 26 


F-MAMMA1003126 


216 


R-MAMMA1 003126 


1032 


NT2RM 1000407 


F-NT2RM1 000407 


217 






NT2RM 1000462 


F-NT2RM1 000462 


218 






NT2RM 1000542 


F-NT2RM1 000542 


219 






NT2RM 1000580 


F-NT2RM1 000580 


220 






NT2RM 1000789 


F-NT2RM1 000789 


221 






NT2RM 1000855 


F-NT2RM1 000855 


222 






NT2RM 1000858 


F-NT2RM1 000858 


223 






NT2RM 1000899 


F-NT2RM1 000899 


224 






NT2RM2000241 i 


F-NT2RM2000241 


225 






NT2RM2000306 


F-NT2RM2000306 j 


226 






NT2RM2000410 


F-NT2RM2000410 


227 






NT2RM2000423 


F-NT2RM2000423 


228 






NT2RM2000497 


F-NT2RM2000497 


229 






NT2RM2000514 


F-NT2RM2000514 j 


230 






NT2RM2000565 


F-NT2 RM2000565 

■ * V 9 A—- 1 II WW AbWVwWW I 


£-0 I 1 






NT2RM2000582 


F-NT2RM2000582 


232 






NT2RM2000589 


F-NT2RM2000589 


233 






NT2RM2000622 


F-NT2RM2000622 


234 . 






NT2RM2000632 


F-NT2RM2000632 


235 






NT2RM2000773 


F-NT2RM2000773 


236 






NT2RM2001126 


F-NT2RM2001126 


237 






NT2RM2001558 


F-NT2RM2001558 


238 






| NT2RM2001626 | 


F-NT2RM2001626 | 


239 | 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 


Name of 5'-sequence 


SEQ ID 


3 Name of S'-sequence 


SEQ ID 


NT2RM2001643 


F-NT2RM2001643 


240 






NT2RM2001738 


F-NT2RM2001738 


241 






NT2RM2001767 


F-NT2RM2001767 


242 






NT2RM2001792 


F-NT2RM2001792 


243 






NT2RM2001818 


F-NT2RM2001818 


244 






NT2RM2001902 


F-NT2RM2001902 


245 






NT2RM2001939 


F-NT2RM2001939 


246 






NT2RM2001941 


F-NT2RM2001941 


247 






NT2RM4000100 


F-NT2RM4000100 


248 


R-NT2RM4000100 


1033 


NT2RM4000115 


F-NT2RM4000115 


249 


R-NT2RM4000115 


1034 


NT2RM4000198 


F-NT2RM4000198 


250 


R-NT2RM4000198 


1035 


NT2RM4000284 


F-NT2RM4000284 


251 


R-NT2RM4000284 


1036 


NT2RM4000295 


F-NT2RM4000295 


252 


R-NT2RM4000295 


1037 


NT2RM4000326 


F-NT2RM4000326 


253 


R-NT2RM4000326 


1038 


NT2RM4000417 


F-NT2RM4000417 


254 


R-NT2RM4000417 


1039 


NT2RM4000444 


F-NT2RM4000444 


255 


R-NT2RM4000444 


1040 


NT2RM4000587 


F-NT2RM4000587 


256 


R-NT2RM4000587 


1041 


NT2RM4000593 


F-NT2RM4000593 


257 


R-NT2RM4000593 


1042 


NT2RM4000648 


F-NT2RM4000648 


258 


R-NT2RM4000648 


1043 


NT2RM4000761 


F-NT2RM4000761 


259 


R-NT2RM4000761 


1044 


NT2RM4000965 


F-NT2RM4000965 


260 


R-NT2RM4000965 


1045 


NT2RM4000997 


F-NT2RM4000997 


261 


R-NT2RM4000997 


1046 


NT2RM4001321 


F-NT2RM4001321 


262 


R-NT2RM4001321 


1047 


NT2RM4001325 


F-NT2RM4001325 


263 


R-NT2RM4001325 


1048 


NT2RM4001377 


F-NT2RM4001377 


264 


R-NT2RM4001377 


1049 


NT2RM4001735 


F-NT2RM4001735 


265 


R-NT2RM4001735 


1050 


NT2RM4001768 


F-NT2RM4001768 


266 


R-NT2RM4001768 


1051 


NT2RM4001843 


F-NT2RM4001843 


267 


R-NT2RM4001843 


1052 


NT2RM4002352 


F-NT2RM4002352 


268 


R-NT2RM4002352 


1053 


NT2RP1 000002 


F-NT2RP1 000002 


269 






NT2RP1 000050 


F-NT2RP1 000050 


270 






NT2RP1000181 


F-NT2RP1000181 


271 






NT2RP1 000239 


F-NT2RP1 000239 


272 






NT2RP1 000261 


F-NT2RP1 000261 


273 






NT2RP1 000271 


F-NT2RP1 000271 


274 






NT2RP1 000300 


F-NT2RP1 000300 


275 






NT2RP1 000325 


F-NT2RP1 000325 


276 






NT2RP1 000448 


F-NT2RP1 000448 


277 






NT2RP1 000465 


F-NT2RP1 000465 


278 






NT2RP1 000468 


F-NT2RP1 000468 


279 






NT2RP1000551 


F-NT2RP1 000551 


280 






NT2RP1 000579 


F-NT2RP1 000579 


281 






NT2RP1000613 


F-NT2RP1000613 


282 






NT2RP1 000679 


F-NT2RP1 000679 


283 






NT2RP1 000740 


F-NT2RP1 000740 


284 






NT2RP1 000903 


F-NT2RP1 000903 


285 






NT2RP1 000981 


F-NT2RP1 000981 


286 






NT2RP1001004 


F-NT2RP1001004 


287 
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Table 1 (continued) 



Correspondence be tween names of done and the se quence name ,nH th» jj5 »n 



Name of clone 



NT2RP1001020 
NT2RP1001031 
NT2RP1001563 
NT2RP2000092 
NT2RP2000178 
NT2RP2000240 
NT2RP2000394 
NT2RP2000447 

NT2RP2000479 
NT2RP2000514 
NT2RP2000533 
NT2RP2000610 
NT2RP2000616 
NT2RP2000649 
NT2RP2000663 
NT2RP2000694 
NT2RP2000712 
NT2RP2000739 
NT2RP2000818 
NT2RP2000903 
NT2RP2001200 
NT2RP2001223 
NT2RP2001276 
NT2RP2001388 
NT2RP2001469 
NT2RP2001480 
NT2RP2001495 
NT2RP2001514 
NT2RP2001529 
NT2RP2001538 
NT2RP2001562 
NT2RP2001662 
NT2RP2001755 
NT2RP2001769 
NT2RP2001817 
NT2RP2001878 
NT2RP2001903 
NT2RP2001915 
NT2RP2001921 
NT2RP2001948 
NT2RP2001956 
NT2RP2002015 
NT2RP2002063 
NT2RP2002188 
NT2RP2002232 
NT2RP2002304 
NT2RP2002409 
NT2RP2002510 



Name of ^sequence | SEQ ID J 3 Nam e of ^sequencTjlEoln 

288 



F-NT2RP1001020 

F-NT2RP1001031 I 289 

F-NT2RP1001563 290 

F-NT2RP2000092 291 

F-NT2RP2000178 292 

F-NT2RP2000240 293 

F-NT2RP2000394 294 

F-NT2RP2000447 295 

F-NT2RP2000479 296 

F-NT2RP2000514 297 

F-NT2RP2000533 298 

F-NT2RP2000610 299 

F-NT2RP2000616 300 

F-NT2RP2000649 301 

F-NT2RP2000663 302 

F-NT2RP2000694 303 

F-NT2RP2000712 304 

F-NT2RP2000739 305 

F-NT2RP2000818 306 

F-NT2RP2000903 307 

F-NT2RP2001200 308 

F-NT2RP2001223 309 

F-NT2RP2001276 310 

F-NT2RP2001388 311 

F-NT2RP2001469 312 

F-NT2RP2001480 313 

F-NT2RP2001495 314 

F-NT2RP2001514 315 

F-NT2RP2001529 316 

F-NT2RP2001538 317 

F-NT2RP2001562 318 

F-NT2RP2001662 319 

F-NT2RP2001755 320 

F-NT2RP2001769 321 

F-NT2RP2001817 322 

F-NT2RP2001878 323 

F-NT2RP2001903 324 

F-NT2RP2001915 325 

F-NT2RP2001921 326 

F-NT2RP2001948 327 

F-NT2RP2001956 328 

F-NT2RP2002015 329 

F-NT2RP2002063 330 

F-NT2RP2002188 331 

F-NT2RP2002232 332 

F-NT2RP2002304 333 

F-NT2RP2002409 334 

F-NT2RP2002510 | 335 



R-NT2RP2000092 I 1054 

R-NT2RP2000178 1055 

R-NT2RP2000240 1056 

R-NT2RP2000394 1057 

R-NT2RP2000447 1 058 

R-NT2RP2000479 1059 

R-NT2RP2000514 1060 

R-NT2RP2000533 1061 

R-NT2RP2000616 1062 

R-NT2RP2000649 1063 

R-NT2RP2000663 1064 

R-NT2RP2000712 1065 

R-NT2RP2000739 1066 

R-NT2RP2000818 1067 

R-NT2RP2000903 1068 

R-NT2RP2001200 1069 

R-NT2RP2001223 1070 

R-NT2RP2001276 1071 

R-NT2RP2001388 1072 

R-NT2RP2001469 1073 

R-NT2RP2001480 1074 

R-NT2RP2001495 1075 

R-NT2RP2001514 1076 

R-NT2RP2001538 1077 

R-NT2RP2001562 1078 

R-NT2RP2001662 1079 

R-NT2RP2001755 1080 

R-NT2RP2001769 1081 

R-NT2RP2001817 1082 

R-NT2RP2001878 1083 

R-NT2RP2001903 1084 

R-NT2RP2001915 1085 

R-NT2RP2001921 1086 

R-NT2RP2001948 1087 

R-NT2RP2001956 1088 

R-NT2RP2002015 1089 

R-NT2RP2002063 1090 

R-NT2RP2G02188 1091 

R-NT2RP2002232 1092 

R-nnnnnnnnnnnn 1093 

R-NT2RP2002409 1094 

R-NT2RP2002510 1095 
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Table 1 (continued) 





Correspondence between names of clone and the sequence name, and the SEQ ID. 




Name of clone 


Name of 5'-seauence 


SEQ ID 


3 Name of 3'-seouence 


SEQ ID 


5 


NTPRPP009RP7 


F-NTP RPP00PRP7 


11R 
ooo 


R-NTP R PPnftP*>P7 
n IN icnrtuutSt/ 


1 fiQfi 




NT2RP2002533 


F-NT2RP2002533 


337 


R-NT2RP2002533 


1097 




NTPRPP00PRR4 


F-NT2 R P2002564 


lift 


R-NTP RPP00P l Sfi4 
n in 1 tnr twtouH 


1 0Q8 




NTPRPP002674 


F-NT2RP2002674 


11Q 


R-NTPRPP00P674 

11 INI tl ir tUvuUf H 


1 0QQ 

1 UCJC7 


10 


NTPRPP002721 


F-NTP RPP00P7P1 

1 IN 1 ti ir tvUt / t 1 


140 


R-NTP R PP00P7P 1 

n in i t n rtuut / t i 


1 1 uU 


NTPR PP00PRP4 


F-NTP RPP00PRP4 

i in i tnitvUtOLl 


141 

OH- 1 


R-NTP R PPnnPRP4 
n in i tnrtuototH 


1101 

1 Iv 1 




NTPRPP00PQ42 

IN 1 1 1 IILUulD^L 


F-NTP RPP0QPQ4P 

1 INI tl llt.vvC.i7Ht 


14P 

OHt 


R-NTP R PP00PQ4P 
n in i tnrtwtOHt 


11 OP 




NTPRPP00PQ74 

IN 1 tnrtV_/UtC7/ H 


F-NTP RPP00PQ74 
r in 1 1 nr &w£9f h 


141 

OHO 


R-NTP RPP00PQ74 
n in i trirtuuty/t 


1101 
1 1 oo 




NTPRPP00PQ7R 


F-NTP RPP00PQ7R 
i i n it nrtwtjj / o 


144 


R-NTP RPP00PQ7R 
n in i trirtvjvtc? / o 


1104 

1 1 OH 


15 


NT9RPP00104P 

IN 1 1 1 11 tUWvJV^Tt. 

NTPRPP001138 
in i tnr cuuo i oo 


F-NTP RPP00104P 
i in i tnrtuuuv'Tt 

F-NTPRPP00111ft 

I IN 1 trirtWO 1 OO 


14 R 

OHO 

14R 

OHO 


R-NTP R PPnO104P 

(1 IN 1 tint. UUwU^rt 


11 OS 

1 1 oo 




NTPRPP0011 7Q 
In 1 Lnr^UUO 1 / 57 


F-NTP RPP0011 7Q 
• ini tnrcwo i / J? 


147 

OH/ 


R-NTPRPPftni1 7Q 
n in 1 1 nrtuvjo i / & 


11 OR 
I I oo 




ntprppooipi n 

In 1 cnrtUUOt 1 o 


f-ntprppooipi o 

t IN 1 till tOOOt 1 O 


14ft 

OHO 


R-NTPRPPnniPi n 

n IN 1 tnr 1 o 


1 1 07 
I 1 o / 




NTPRPP00110P 


f-ntprppooiiop 

i IN 1 till tUUOOUc 


14Q 

OH J 


R-NTPRPPnniinp 


1 1 0ft 
I I oo 


NTPRPP0011RQ 

In ItnrtUVOOOj 


F-NTP RPP0011RQ 


1R0 


R-NTPRPPOfillfiQ 
n in i tnrtuuoooy 


110Q 

1 1 0\7 




NTPRPPOOIIftl 
in I tnrtuuoooo 


F-NTP RPPOOIIftl 

r IN 1 t rir tUUUvJOO 


1R1 


R-NTPRPPnmiftl 
n in 1 1 nrtuuoooj 


1110 
1 I 1 o 




NTPRPP0011Q0 


F-ntprppooiiqo 

i in i tnrtuuojyu 


1RP 
out 


R-NTPRPpnniiQn 

n in i tnrtuuooyu 


1111 

1 1 1 1 




NTPRPP0014RQ 
in i t nr tuuonoa 


F-NTP RP90014RQ 
r in i tnrtuuoHoj 


1R1 


R-NTPRPPnfl14RQ 
n in itnrtuooHOJj 


111 p 

lilt 


25 


MTP R ppooir4r 


F-NTP RPP001R4R 
i in i tnrtuuojtj 


1R4 

OoH 


R-NTPRPPnniR4^ 
n in i tnr £.\J\j^zjhzj 


1111 

1 1 1 o 




MTPRPP001RQ1 
In 1 tnrtUUOjjO 


F-NTP RPPOOIRQI 

"IN 1 cnrtUUOOyO 


ooo 


R-NTPRPPnni^QI 
n in i tnr 


111/1 

1 1 1 H 




NTPRPP001RQQ 
In 1 tnrtUUOOoc 


F-NTP RPP001RQQ 


I^R 
ooo 


R-NTP RPPnni^QQ 
n in itnrtuuocyy 


11 1 R 

1 I I o 




NT9RPP001RRR 

In 1 tnrtUUOD3J 


F-NTP RPP001RRR 

r IN 1 tnr tOUOOOO 


1^7 
oo/ 


R-NTPRPPnniRR^ 
n in i tnr 


11 1 R 
1 1 1 o 


30 


NT9RPP001RR4 

IN I cnrtuUOD04 


F-NTP RPP001RR4 
r in i tnrtuuooot 


l^ft 
ooo 


R-NTPRPPnniRR4 
n in i tnrtwooooH 


1117 
ill/ 


NTPRPP001Q11 
In I tnrcUUOaO I 


F-NTP RPP001Q11 
i IM 1 tnrtUUJyO i 


I^Q 


R-NTPRPPnniQ11 
n IN ItnrcUUOyj I 


111ft 
1 I I o 




In IcnrtUUJyHU 


F-NTP RPPnmoAn 


^RO 
ooU 


R-MT9RP9nn^Qiin 

n-|N 1 tnr tUUoiJHU 


1 1 1 Q 
ilia 




mtorpoooiqro 


F-NTP RPPOO^QRO 
r-IN 1 tnr tUUJyDU 


OOl 


R-NTQRPOnniQ^ft 
n-IM ltnrtUU«jyOU 


1 1 PO 
I ItU 




ntprppoo40rq 


F- NTP R PP0040RQ 


1RP 
out 


R-NTP RPPnn4flRQ 
rl'IM ItnrtUUHUOa 


1 1 P1 
I It I 


35 


ntprppoo41 oft 

in 1 t nr tUUt i oo 


F-NTP RPP0041 Oft 
r~IN I cnrcwH t OO 


1R1 
OOO 


R-NTPRPpnn4i na 
n in i tnr tuui i uo 


11PP 
i i tt 




NTPRPP0041 41 
In 1 tnr tUUt l H 1 


F-NTP RPP0041 41 
r IN 1 cnrtvUt 1 *f 1 


1R4 

OOH 


R-NTPRPPnn4141 

n IN 1 £.rif£.\J\JH IH 1 


1 1P1 
1 1 to 




NTPRPP0041 7Q 
In I LnrtVU'r 1 / £7 


F-NTP RPPH041 7Q 
r in i tnrtUUH I / & 


1RR 
ooo 


R-NTPRPPnn41 7Q 

n IN l £.fir£.\f\JH 1 / & 


1 1P4 

I 1 tH 




IM ItnrtUU'ftUO 


F-NTorpqooaoor 

r-IM 1 tnrtUUHtUo 


ODD 


ri-IM 1 tnrtUUH^Uo 


1 1 OR 

I Ito 


40 


NT9RP9nn<47L47 


F-NT9RP9HO/1/1/17 
r-IM 1 1 nrtUU'f'W/ 


QR7 
OD/ 


n-(N I tnr tUU44^/ 


11 9R 

1 1 to 


IN 1 tnrcUUI'tyO 


F- NT9 RP900/1/1QR 
r-IM 1 tnrtUU'WyO 


OOO 


R NT9RP9nn/L4QR 
n - IN i trirtUUHH570 


1 1P7 
lit/ 




N TP R P P 0 0/1 RP4 

IN 1 tnrcUU4Jt4 


F-NTP RPP004RP4 
r in i tnrtvUHOtH 


IRQ 

OOv7 


R-NTP RPPnn4 c iP4 
n in i tnrtuuHotH 


1 1 Pft 

I 1 tO 




NTPRPP004RRR 


F-NTP RP9004RRR 
r in i trirtuufooo 


170 
o / u 


R-NTPRPPnn4R*^R 
n in i tnrtooHooo 


1 1 PQ 

1 1 157 




NTORPonn/iROR 

In 1 tnrtUU'tDUD 


F-NTP RPPOO/1ROR 
r-IN 1 tnrtUUHoUO 


171 
o/ 1 


R-NTPRPpnnziRnR 
n - iM i tnrtuuHouo 


1 1 in 

1 loo 


45 


N TO R P9 O 0/1 R/lft 
In 1 tnrtUU'tD'fO 


F-NTP RPOOO/lRAft 


179 


R NT9RPOnn4R4ft 
n IN 1 tnrtUUHD4u 


1 1 11 

II O I 




NTPRPP004R70 


F-NTP RPP004R7H 
r IN 1 tnrtUUHDi \) 


171 
O/ o 


R-NTPRPPnn4R70 
n-iN i tnrtv>UHO/u 


11 IP 
1 1 Ot 




NTPRPP0047Q4 


F-NTP R PP0047Q4 
r in 1 t nrtoon / oh 


174 

O / H 


R-NTP RPPfin47Q4 
n in i tnrtuoH/ jjh 


1 1 11 

1 1 OO 




NT2RP2004837 


F-NT2RP2004837 


375 


R-NT2RP2004837 


1134 


50 


NT2RP2004847 


F-NT2RP2004847 


376 


R-NT2RP2004847 


1135 


NT2RP2005027 


F-NT2RP2005027 


377 


R-NT2RP2005027 


1136 




NT2RP2005069 


F-NT2RP2005069 


378 


R-NT2RP2005069 


1137 




NT2RP2005163 


F-NT2RP2005163 


379 


R-NT2RP2005163 


1138 




NT2RP2005181 


F-NT2RP2005181 


380 


R-NT2RP2005181 


1139 


55 


NT2RP2005247 


F-NT2RP2005247 


381 


R-NT2RP2005247 


1140 




NT2RP2005378 


F-NT2RP2005378 


382 


R-NT2RP2005378 


1141 




NT2RP2005391 


F-NT2RP2005391 


383 


R-NT2RP2005391 


1142 



11 



EP1 130 094 A2 



Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



to 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Name of clone 

IMI2RP2005425 
NT2RP2005463 
NT2RP2005514 
NT2RP2005535 
NT2RP2005541 
NT2RP2005597 
NT2RP2005632 
NT2RP2005666 
NT2RP2005774 
NT2RP2005878 
NT2RP2005883 

NT2RP2005887 
NT2RP2005941 
NT2RP2005994 
NT2RP2006004 
NT2RP2006042 
NT2RP2006092 
NT2RP2006099 
NT2RP2006134 
NT2RP2006269 
NT2RP2006512 
NT2RP3000011 
NT2RP3000022 
NT2RP3000059 
NT2RP3000063 
NT2RP3000125 
NT2RP3000148 
NT2RP3000169 
NT2RP3000171 
NT2RP3000172 
NT2RP3000201 
NT2RP3000232 
NT2RP3000304 
NT2RP3000378 
NT2RP3000427 
NT2RP3000436 
NT2RP3000444 
NT2RP3000460 
NT2RP3000481 
NT2RP3000616 
NT2RP3000645 
NT2RP3000652 
NT2RP3000676 
NT2RP3000677 
NT2RP3000721 
NT2RP3000789 
NT2RP3000818 
NT2RP3000820 



Name of 5'-sequence I SEQ ID 

KNT2RP2005425 384 

F-NT2RP2005463 385 

F-NT2RP2005514 386 

F-NT2RP2005535 387 

F-NT2RP2005541 388 

F-NT2RP2005597 389 

F-NT2RP2005632 390 

F-NT2RP2005666 391 

F-NT2RP2005774 392 

F-NT2RP2005878 393 

F-NT2RP2005883 394 

F-NT2RP2005887 395 

F-NT2RP2005941 396 

F-NT2RP2005994 397 

F-NT2RP2006004 398 

F-NT2RP2006042 399 

F-NT2RP2006092 400 

F-NT2RP2006099 401 

F-NT2RP2006134 402 

F-NT2RP2006269 403 

F-NT2RP2006512 404 

F-NT2RP3000011 405 

F-NT2RP3000022 406 

F-NT2RP3000059 407 

F-NT2RP3000063 408 

F-NT2RP3000125 409 

F-NT2RP3000148 410 

F-NT2RP3000169 411 

F-NT2RP3000171 412 

F-NT2RP30001 72 413 

F-NT2RP3000201 41 4 

F-NT2RP3000232 415 

F-NT2RP3000304 416 

F-NT2RP3000378 j 417 

F-NT2RP3000427 [ 418 

F-NT2RP3000436 41 9 

F-NT2RP3000444 420 

F-NT2RP3000460 421 

F-NT2RP3000481 422 

F-NT2RP3000616 423 

F-NT2RP3000645 424 

F-NT2RP3000652 425 

F-NT2RP3000676 426 

F-NT2RP3000677 427 

F-NT2RP3000721 428 

F-NT2RP3000789 429 

F-NT2RP3000818 430 

F-NT2RP3000820 | 431 



3 Name of 3'-sequence SEQ ID 
H-NT2RP2005425 TT43 
R-NT2RP2005463 1 1 44 

R-NT2RP2005514 11 45 

R-NT2RP2005535 1146 
R-NT2RP2005541 11 47 

R-NT2RP2005597 1143 

R-nnnnnnnnnnnn 1149 

R-NT2RP2005666 1150 

R-NT2RP2005774 11 51 

R-NT2RP2005878 11 52 

R-NT2RP2005883 11 53 

R-NT2RP2005887 11 54 

R-nnnnnnnnnnnn 1155 

R-NT2RP2005994 1156 

R-NT2RP2006004 11 57 

R-NT2RP2006042 1158 

R-NT2RP2006092 11 59 

R-NT2RP2006099 1160 

R-NT2RP2006134 1161 

R-NT2RP2006269 1162 

R-NT2RP2006512 1163 

R-NT2RP3000011 1164 

R-NT2RP3000022 1165 

R-NT2RP3000059 1166 

R-NT2RP3000063 1167 
R-nnnnnnnnnnnn j 11 68 

R-NT2RP3000148 1169 

R-NT2RP3000169 11 70 

R-NT2RP3000171 11 71 

R-NT2RP3000172 11 72 

R-NT2RP3000201 1 1 73 

R-NT2RP3000232 11 74 
R-NT2RP3000304 j 11 75 

R-NT2RP3000378 1176 



R-NT2RP3000436 I 11 77 

R-NT2RP3000444 1173 

R-NT2RP3000460 11 79 

R-NT2RP3000481 1180 

R-NT2RP3000616 1181 

R-NT2RP3000645 1182 

R-NT2RP3000652 1183 

R-NT2RP3000676 1184 

R-NT2RP3000677 1185 

R-NT2RP3000721 1186 

R-NT2RP3000789 1187 

R-NT2RP3000818 1188 
R-NT2RP3000820 ! 1189 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 





Name of clone 


Name of 5'-sequence 


SEQ ID 


3 Name of 3'-sequence 


SEQ ID 


5 


NT2RP3000838 


F-NT2RP3000838 


432 


R-NT2RP3000838 


1190 




NT2RP3000871 


F-NT2RP3000871 


433 


R-NT2RP3000871 


1191 




NT2RP3000907 


F-NT2RP3000907 


434 


R-NT2RP3000907 


1192 




NT2RP3000921 


F-NT2RP3000921 


435 


R-NT2RP3000921 


1193 


1 n 

lU 


NT2RP3001012 


F-NT2RP3001012 


436 


R-NT2RP3001012 


1194 


NT2RP3001044 


F-NT2RP3001044 


437 


R-NT2RP3001044 


1195 




NT2RP3001061 


F-NT2RP3001061 


438 


R-NT2RP3001061 


1196 




NT2RP3001159 


F-NT2RP3001159 


439 


R-NT2RP3001159 


1197 




NT2RP3001170 


F-NT2RP3001170 


440 


R-NT2RP3001170 


1198 


15 


NT2RP3001195 


F-NT2RP3001195 


441 


R-NT2RP3001195 


1199 




NT2RP3001240 


F-NT2RP3001240 


442 


R-NT2RP3001240 


1200 




NT2RP3001271 


F-NT2RP3001271 


443 


R-NT2RP3001271 


1201 




NT2RP3001322 


F-NT2RP3001322 


444 


R-NT2RP3001322 


1202 


20 


NT2RP3001388 


F-NT2RP3001388 


445 






NT2RP3001542 


F-NT2RP3001542 


446 


R-NT2RP3001542 


1203 




NT2RP3001560 


F-NT2RP3001560 


447 


R-NT2RP3001560 


1204 




NT2RP3001592 


F-NT2RP3001592 


448 


R-NT2RP3001592 


1205 




NT2RP3001650 .... 


F-NT2RP3001650 


449 






25 


NT2RP3001685 


F-NT2RP3001685 


450 


R-NT2RP3001685 


1206 




NT2RP3001738 


F-NT2RP3001738 


451 


R-NT2RP3001738 


1207 




NT2RP3001754 


F-NT2RP3001754 


452 


R-NT2RP3001754 


1208 




NT2RP3001858 


F-NT2RP3001858 


453 


R-NT2RP3001858 


1209 


30 


NT2RP3001976 


F-NT2RP3001976 


454 


R-NT2RP3001976 


1210 


NT2RP3002015 


F-NT2RP3002015 


455 


R-NT2RP3002015 


1211 




NT2RP3002160 


F-NT2RP3002160 


456 


R-NT2RP3002160 


1212 




NT2RP3002281 


F-NT2RP3002281 


457 


R-NT2RP3002281 


1213 




NT2RP3002286 


F-NT2RP3002286 


458 


R-NT2RP3002286 


1214 


35 


NT2RP3002311 


F-NT2RP3002311 


459 


R-NT2RP3002311 


1215 




NT2RP3002324 


F-NT2RP3002324 


460 


R-NT2RP3002324 


1216 




NT2RP3002342 


F-NT2RP3002342 


461 


R-NT2RP3002342 


1217 




NT2RP3002353 


F-NT2RP3002353 


462 


R-NT2RP3002353 


1218 


40 


NT2RP3002409 


F-NT2RP3002409 


463 


R-NT2RP3002409 


1219 


NT2RP3002411 


F-NT2RP3002411 


464 


R-NT2RP3002411 


1220 




NT2RP3002448 


F-NT2RP3002448 


465 


R-NT2RP3002448 


1221 




NT2RP3002571 


F-NT2RP3002571 


466 


R-NT2RP3002571 


1222 




NT2RP3002664 


F-NT2RP3002664 


467 


R-NT2RP3002664 


1223 


45 


NT2RP3002721 


F-NT2RP3002721 


468 


R-NT2RP3002721 


1224 




NT2RP3002737 


F-NT2RP3002737 


469 


R-NT2RP3002737 


1225 




NT2RP3002738 


F-NT2RP3002738 


470 


R-NT2RP3002738 


1226 




NT2RP3002790 


F-NT2RP3002790 


471 


R-NT2RP3002790 


1227 


50 


NT2RP3002836 


F-NT2RP3002836 


472 


R-NT2RP3002836 


1228 


NT2RP3002887 


F-NT2RP3002887 


473 


R-NT2RP3002887 


1229 




NT2RP3002900 


F-NT2RP3002900 


474 


R-NT2RP3002900 


1230 




NT2RP3002958 


F-NT2RP3002958 


475 


R-NT2RP3002958 


1231 




NT2RP3002983 


F-NT2RP3002983 


476 


R-NT2RP3002983 


1232 


55 


NT2RP3003000 


F-NT2RP3003000 


477 


R-NT2RP3003000 


1233 




NT2RP3003076 


F-NT2RP3003076 


478 


R-NT2RP3003076 


1234 




NT2RP3003354 


F-NT2RP3003354 


479 


R-NT2RP3003354 


1235 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 

NT2RP3003448 
NT2RP3003469 
NT2RP3003473 
NT2RP3003527 
NT2RP3003532 
NT2RP3003535 
NT2RP3003559 
NT2RP3003614 
NT2RP3003729 
NT2RP3003849 
NT2RP3003874 
NT2RP3003939 
NT2RP3003963 
NT2RP3004000 
NT2RP3004025 
NT2RP3004067 
NT2RP3004075 
NT2RP3004083 
NT2RP3004090 
NT2RP3004119 
NT2RP3004130 
NT2RP3004133 
NT2RP3004202 
NT2RP3004294 
NT2RP3004309 
NT2RP3004321 
NT2RP3004345 
NT2RP3004355 
NT2RP3004374 
NT2RP3004406 
NT2RP3004481 
NT2RP3004552 
NT2RP3004557 
NT2RP3004625 
NT2RP3004640 
NT2RP3004647 
NT2RP4000108 
NT2RP4000634 
NT2RP4000962 
NT2RP4001001 
NT2RP4001009 
NT2RP4001467 
NT2RP4001877 
NT2RP4001879 
NT2RP4002187 
NT2RP4002451 
NT2RP4002715 
NT2RP4002750 



Name of 5'-sequence SEQ ID 
F-NT2RP3003448 480 
F-NT2RP3003469 481 
F-NT2RP3003473 482 
F-NT2RP3003527 483 
F-NT2RP3003532 484 
F-NT2RP3003535 485 
F-NT2RP3003559 486 
F-NT2RP3003614 487 
F-NT2RP3003729 488 
F-NT2RP3003849 489 
F-NT2RP3003874 490 

F-NT2RP3003939 491 
F-NT2RP3003963 492 

F-NT2RP3004000 493 

F-NT2RP3004025 494 

F-NT2RP3004067 495 

F-NT2RP3004075 496 

F-NT2RP3004083 497 

F-NT2RP3004090 498 

F-NT2RP3004119 499 

F-NT2RP3004130 500 

F-NT2RP3004133 501 

F-NT2RP3004202 502 

F-NT2RP3004294 503 

F-NT2RP3004309 504 

F-NT2RP3004321 505 

F-NT2RP3004345 506 

F-NT2RP3004355 507 

F-NT2RP3004374 508 

F-NT2RP3004406 509 

F-NT2RP3004481 510 

F-NT2RP3004552 511 

F-NT2RP3004557 512 

F-NT2RP3004625 513 

F-NT2RP3004640 514 

F-NT2RP3004647 515 

F-NT2RP4000108 516 

F-NT2RP4000634 517 

F-NT2RP4000962 518 

F-NT2RP4001001 519 

F-NT2RP4001009 520 

F-NT2RP4001467 521 

F-NT2RP4001877 522 

F-NT2RP4001879 523 

F-NT2RP4002187 524 

F-NT2RP4002451 525 

F-NT2RP4002715 526 

F-NT2RP4002750 | 527 



3 Name of 3'-sequence 


SEQ ID I 


R-NT2RP3003448 


1236 


R-NT2RP3003469 


1237 


R-NT2RP3003473 


1238 


R-NT2RP3003527 


1239 


R-NT2RP3003532 


1240 


R-nnnnnnnnnnnn 


1241 


R-NT2RP3003559 


1242 


R-NT2RP3003614 


1243 


R-NT2RP3003729 


1244 


R-NT2RP3003849 


1245 


R-NT2RP3003874 


1246 



R-NT2RP3003963 
R-NT2RP3004000 
R-NT2RP3004025 

R-NT2RP3004075 

R-NT2RP3004083 

R-NT2RP3004090 

R-NT2RP3004119 

R-NT2RP3004130 

R-NT2RP3004133 

R-NT2RP3004202 

R-NT2RP3004294 

R-NT2RP3004309 

R-NT2RP3004321 

R-NT2RP3004345 

R-NT2RP3004355 

R-NT2RP3004374 

R-NT2RP3004406 

R-NT2RP3004481 

R-NT2RP3004552 



1247 
1248 
1249 

1250 

1251 

1252 

1253 

1254 

1255 

1256 

1257 

1258 

1259 

1260 

1261 

1262 

1263 

1264 

1265 



R-NT2RP3004625 


[ 1266 


R-NT2RP3004640 


1267 


R-NT2RP3004647 


1268 


R-NT2RP4000108 


1269 


R-NT2RP4000634 


1270 


R-NT2RP4000962 


1271 


R-NT2RP4001001 


1272 


R-NT2RP4001009 


1273 


R-NT2RP4001467 


1274 


R-NT2RP4001877 


1275 


R-NT2RP4001879 


1276 


R-NT2RP4002187 


1277 


R-NT2RP4002451 


1278 


R-NT2RP4002715 


1279 


R-NT2RP4002750 


1280 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 





Name of clone 


Name of 5'-sequence 


SEQ ID 


3 Name of S'-sequence 


SEQ ID 


5 


OVARC1 000003 


F-OVARC1 000003 


528 


R-0VARC1 000003 


1281 




OVARC1 000090 


F-OVARC1 000090 


529 


R-0VARC1 000090 


1282 




OVARC1 000105 


F-OVARC1000105 


530 


R-OVARC1000105 


1283 




OVARC1000137 


F-OVARC1000137 


531 


R-OVARC1000137 


1284 


10 


OVARC1 000208 


F-OVARC1 000208 


532 


R-OVARC1 000208 


1285 


OVARC1 000255 


F-OVARC1 000255 


533 


R-OVARC1 000255 


1286 




OVARC1 000275 


F-OVARC 1000275 


534 


R-0VARC1 000275 


1287 




OVARC1 000298 


F-OVARC 1000298 


535 


R-OVARC1 000298 


1288 




OVARC1 000307 


F-OVARC1 000307 


536 


R-OVARC1 000307 


1289 


15 


OVARC1000313 


F-OVARC1000313 


537 


R-OVARC1000313 


1290 




OVARC1 000331 


F-OVARC1 000331 


538 


R-OVARC1 000331 


1291 




OVARC 10004 10 


F-OVARC1 000410 


539 


R-OVARC1 000410 

P ■ WW » • P | w wmr *tmT T P wmr 


1292 




OVARC 1000439 


F-OVARC1 000439 


540 


R-OVARC1 000439 

P ■ VP Pi P ■ *mmW 1 X^ X^ ¥ X^ 


1293 

P *^ V X^ 


20 


OVARC1 000467 


F-OVARC1 000467 


541 


R-OVARC1 000467 


1294 


OVARC1 000529 


F-OVARC1 000529 


542 


R-OVARC1 000529 


1295 




OVARC 1000553 


F-OVARC 1000553 


543 


R-OVARC1 000553 


1296 




OVARC 1000775 


F-OVARC1 000775 


544 


R-OVARC1 000775 


1297 




OVARC1 000811 


F-OVARC1 000811 


545 


R-OVARC1 000811 

• ■ X^ WW % P P r V V 


1298 


25 


OVARC 1000853 


F-OVARC 1000853 

■ ^ i r m * » « ■ ^ ■ ' » X^ x» X^ X^X^ X^ 


546 


R-OVARC1 000853 

P P X mW WW 9> W P X^ ^ * 


1299 

P Wmmm XX V 




OVARC 1000873 


F-OVARC1 000873 


547 


R-OVARC1 000873 


1300 




OVARC1000916 


F-OVARC1000916 


548 


R-OVARC1000916 


1301 




OVARC1 000956 


F-OVARC1 000956 


549 


R-OVARC1 000956 


1302 


30 


OVARC1 000995 


F-OVARC1 000995 


550 


R-OVARC1 000995 


1303 


OVARC1001030 

ww \ 9 ft 1 wr X^ I wr X^ ^mr 


F-OVARC1001030 

I W w \ 9 ■ X^ V X^ W ■ Xri* X^ 


551 


R-OVARC1001030 


1304 




OVARC1001049 


F-OVARC1001049 


552 


R-OVARC1001049 


1305 




OVARC 100 1086 

W W » ■ ■ ■ X^ X^ P X* Wmw Wmw 


F-OVARC1001086 


553 


R-OVARC1001086 

■ I fill ■ 1 X^ wmf ■ »JbT 


1306 

9 W \/ Xrf 




OVARC1001132 

WW ■ ■ W P wmr x^ I p x^ m^m 


F-OVARC1001132 

■ x^ w w m ■ ■ w ■ x#» X^ ' ■ Vk 


554 


R-OVARC1001132 


1307 

* w X^ * 


35 


OVARC1001163 


F-OVARC1001163 


555 


R-OVARC1001163 


1308 




OVARC 100 1222 


F-OVARC1001222 


556 


R-OVARC1001222 


1309 




OVARC1001260 


F-OVARC1001260 

■ W W » P ■ Xl^ P X^ XJ» P **mW ^mr 


557 


R-OVARC1001260 

■ ■ ▼ # » I ■ I XX X^ ■ ■ ■ X* X^ 


1310 




OVARC1001336 

w m » w m I x* x^ ■ x^^^ x* 


F-OVARC1001336 

* W B »■ I X^^ P XX X^ ■ Np* XX %P» 


558 

\-W 


R-OVARC1001336 

P ■ V *7 1 ■ ■ 1 *rf X^ ■ 9 LmW ,w LmW X> 


1311 


40 


OVARC1001338 

w / SI ■ P x^ ■ x^ X^ X^ 


F-OVARC1001338 


559 


R-OVARC 100 1338 

P 1 » » 1 1 ■ ***** 9 '•mf X^ 1 Xrf* X^ X^ 


1312 

1 W P * — 


OVARC1001569 

w m mm m ■ ™» x» x^ x^ x^ 


F-OVARC1001569 


560 


R-OVARC1001569 

P K WW V> P ■ 1 X* X^ P X^ X^ 


1313 

1 X^ P Xj*F 




OVARC1001570 


F-OVARC1001570 


561 


R-OVARC1001570 


1314 




OVARC1001596 


F-OVARC1001596 


562 


R-OVARC1001596 


1315 




OVARC1001607 


F>OVARC1001607 


563 


R-OVARC1001607 

P m X WW VI ■ X^ P x^ X^ ■ X^ \^ r 


1316 


45 


OVARC1001725 


F-OVARC1001725 


564 


R-0VARC1 001725 

p p ~\ m * 1 ww » ■ iv p vv I r fcv 


1317 

l x^ t W 




OVARC1001727 


F-OVARC1001727 


565 


R-OVARC1001727 


1318 

P Xr W w 




OVARC1001807 


F-OVARC1001807 


566 


R-0VARC1 001807 


1319 




OVARC1001833 


F-OVARC1001833 


567 


R-OVARC1001833 


1320 


50 


OVARC1001952 


F-OVARC1001952 


568 






OVARC1001991 


F-OVARC1001991 


569 


R-OVARC1 001991 


1321 




OVARC1 002058 


F-OVARC1 002058 


570 


R-0VARC1 002058 


1322 




OVARC1002178 


F-OVARC1002178 


571 


R-OVARC1002178 


1323 




PLACE 1000033 


F-PLACE1 000033 


572 


R-PLACE1 000033 


1324 


55 


PLACE 1000231 


F-PLACE1 000231 


573 


R-PLACE1 000231 


1325 




PLACE 1000258 


F-PLACE1 000258 


574 


R-PLACE1 000258 


1326 




PLACE 1000442 


F-PLACE1 000442 


575 


R-PLACE1 000442 


1327 
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Correspondence 
Name of clone 



Table 1 (continued) 



PLACE 1000560 
PLACE 1000740 
PLACE 1000907 
PLACE 1000912 
PLACE 1000914 
PLACE 1000927 
PLACE 1000986 
PLACE1001016 
PLACE 1001100 
PLACE 1001114 
PLACE 1001123 
PLACE 1 001 1 83 
PLACE 1001229 
PLACE 1001231 
PLACE 1001340 
PLACE1001401 
PLACE 1001407 
PLACE 1001464 
PLACE1001500 
PLACE1001516 
PLACE1001536 
PLACE 1001564 
PLACE 1001655 
PLACE 1001788 
PLACE1001795 
PLACE1001836 
PLACE 1001918 
PLACE 1001949 
PLACE 1002080 
PLACE 1002095 
PLACE 1002153 
PLACE 1002329 
PLACE 1002355 
PLACE1 002374 
PLACE 1002518 
PLACE 1002547 
PLACE 1002726 
PLACE 1002905 
PLACE 1002911 
PLACE 1002967 
PLACE 1003135 
PLACE1003163 
PLACE 1003407 
PLACE 1003428 
PLACE1 003438 
PLACE 1003460 
PLACE 1003529 
PLACE 1003573 



between names of clone and the sequence name, and the SEQ ID. 



Name of 5'-sequence I SEQ ID 



F-PLACE1 000560 
F-PLACE1 000740 
F-PLACE1 000907 
F-PLACE1 000912 
F-PLACE1000914 
F-PLACE1 000927 
F-PLACE1 000986 
F-PLACE1001016 
F-PLACE1001100 
F-PLACE1001114 
F-PLACE1001123 
F-PLACE1001183 
F-PLACE1001229 
F-PLACE1001231 
F-PLACE1001340 
F-PLACE1 001401 
F-PLACE1001407 
F-PLACE1001464 
F-PLACE1001500 
F-PLACE1001516 
F- PLACE 1001536 
F-PLACE1001564 
F-PLACE1001655 
F-PLACE1001788 
F-PLACE1001795 
F-PLACE1001836 
F-PLACE1001918 
F-PLACE1001949 
F-PLACE1 002080 
F-PLACE1 002095 
F-PLACE1002153 
F-PLACE1 002329 
F-PLACE1 002355 
F-PLACE1 002374 
F-PLACE1002518 
F-PLACE1 002547 
F-PLACE1 002726 
F-PLACE1 002905 
F-PLACE1 002911 
F-PLACE1 002967 
F-PLACE1003135 
F-PLACE1003163 
F-PLACE1 003407 
F-PLACE1 003428 
F-PLACE1 003438 
F-PLACE1 003460 
F-PLACE1 003529 
F-PLACE1 003573 



576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 
595 
596 
597 
598 
599 
600 
601 
602 
603 
604 
605 
606 
607 
608 
609 
610 
611 
612 
613 
614 
615 
616 
617 
618 
619 
620 
621 
622 
623 



3 Name of 3'-sequence I SEQ ID 



R-PLACE1 000560 I 1328 

R-PLACE 1000740 1329 

R-PLACE1000912 1330 

R-PLACE1 000914 1331 

R-PLACE1 000927 1332 

R-PLACE1 000986 1333 

R-PLACE1001016 1334 

R-PLACE1001100 1335 

R-PLACE1001114 1336 

R-PLACE1001123 1337 

R-PLACE1001183 1338 

R-PLACE1001229 1339 

R-PLACE1 001231 1340 

R-PLACE1001340 1341 

R-PLACE1 001401 1342 

R-PLACE1001407 1343 

R-PLACE1001464 1344 

R-PLACE 100 1500 1345 

R-PLACE1001516 1346 

R-PLACE1001536 1347 

R-PLACE1001564 1348 

R-PLACE1001655 1349 

R-PLACE1001788 1350 

R-PLACE1001795 1351 

R-PLACE1001836 1352 

R-PLACE1001918 1353 

R-PLACE1001949 1354 

R-PLACE1002080 1355 

R-PLACE1 002095 1356 

R-PLACE1002153 1357 

R-PLACE1 002329 1358 

R-PLACE1 002355 1359 

R-PLACE 1002374 1360 

R-PLACE 1002518 1361 

R-PLACE1 002547 1362 

R-PLACE1 002726 1363 

R-PLACE 1002905 1364 

R-PLACE1 002911 1365 

R-PLACE1 002967 1366 

R-PLACE1003135 1367 

R-PLACE1003163 1368 

R-PLACE1 003407 1369 

R-PLACE1 003428 1370 

RPLACE1 003438 1371 

R-PLACE1 003460 1372 

R-nnnnnnnnnnnn 1373 

R-PLACE 1003573 1374 
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Table 1 (continued) 

Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 


Name of 5'-sequence 


SEQ ID 


3 Name of 3'-sequence 


fk r— /—v ifk 

SEQ ID 


^^x 4 A dt^^ -J ax ax ^x j ax ax 

PLACE 1003598 


ft^™ ■ A A^^Lm J ^A AX AX mW** AX A 

F-PLACE1 003598 


624 


R-PLACE1 003598 


1375 


f*X i A a^ f™ - J a yx. A *V A J 

PLACE 1003644 


F- PLACE 1003644 


625 


m • x #-x i a a«x V— a a a -x ax A A 

R-PLACE1 003644 


1376 


PLACE 1003737 


F-PLACE1 003737 


626 


rx ph | a /\ tr-m A AX A AX -Xa 

R-PLACE1 003737 


1377 


PLACE 1003772 


F- PLACE 1003772 


627 


• X f-—\ | A A^X f A AX AX ^X mmm %> AX 

R-PLACE 1003772 


1378 


PLACE 1003839 


F-PLACE1 003839 


628 


f^X ft A A A A #^ ax ^x /\ 

R-PLACE1 003839 


1379 


1^ 1 A A^ ^ ma ~ J —x kX, ^k ^X 4 

PLACE 1003845 


1 A ^X ft^* -1 AX AX AX A J V 

F-PLACE1 003845 


629 


■ A ABX. ^ mmm * A A A ^X «X A A^ 

R-PLACE1 003845 


1380 


PLACE 1 003 852 


1 A 1^" _J AX A AX AX, ^A A 

F-PLACE1 003852 


630 


PX | A A****. r~ -ft A% A AX AX A 

R-PLACE1 003852 


1381 


PLACE 1 004028 


1 A S~\ t— A AX AX A AX ^X *x 

F-PLACE1 004028 


631 


■ i x ft"x a a a 'x A^Am* « • >k 

R-PLACE 1004028 


1382 


PLACE 1 004078 


l"" f~\ 1 A /% |~~ A ^X yx a n — — » /x 

F-PLACE1 004078 


632 


« X ft^X fl A A— x f - ™* _i aX ax A ax » AX 

R-PLACE1 004078 


1383 


PLACE1004166 


F-PLACE1004166 


633 


f— x. f \ | a r - ~ A ax ^x A A ax a 

R-PLACE1004166 


1384 


PLACE 1004168 


F-PLACE1004168 


634 


R-nnnnnnnnnnnn 


1385 


r*x ■ m a«x\ r™ ' _# ax .^s. a * <pv 

PLACE1004199 


t V I A ft^~ -ft «X A A . *x AX 

F-PLACE1004199 


635 


^^fc. ^*X 1 A A A AX A A AX AX 

R-PLACE1004199 


1386 


PLACE 1004279 


^^X ft A * m X A A A AX 

F-PLACE1 004279 


636 


R-PLACE1 004279 


1387 


PLACE 1004282 


F-PLACE1 004282 


637 


R-PLACE1 004282 


1388 


PLACE 1004305 


1 A A*X J AA AX A ^X A tr" 

F-PLACE1 004305 


638 


R-PLACE1 004305 


1389 


PLACE 1004441 


M— J-~S. 1 A ^^X f _j ^x AX A A A _J 

F-PLACE1 004441 


639 


R-PLACE1 004441 


1390 


PLACE 1004450 


ft^ - ' ^^X ft A J^X. A AX AX A A A"" AX 

F-PLACE1 004450 


640 


R-PLACE1 004450 


1391 


PLACE 1004482 


F-PLACE1 004482 


641 


R-PLACE1 004482 


1392 


a A A"X J a m A A -*"k ax 

PLACE 1004492 


F-PLACE1 004492 


642 


R-PLACE1 004492 


1393 


PLACE 1004519 


F-PLACE1004519 


643 


R-PLACE1004519 


1394 


PLACE 1004520 


m mmmr | A A AX AX ^ » ^k-, ^x 

F- PLACE 1004520 


644 


R-PLACE1 004520 


1395 


PLACE 1 004630 


g X | A A^X ft^~ A A A A AA ^X 

F- PLACE 1004630 


AX A §*-* 

645 


R-PLACE1 004630 


1396 


•"■ft ft A a«*i ■ _j a AX A *x 

PLACE 1004637 


MW maam A^ mm M m A A AX At AX AX 

F- PLACE 1004637 


646 


R-PLACE1 004637 


1397 


PLACE 1004648 


F- PLACE 1004648 


647 


R-PLACE1 004648 


1398 


PLACE 1004816 


F- PLACE 10048 16 


648 


R-PLACE1004816 


1399 


PLACE 1004887 


F-PLACE1 004887 


649 


R-PLACE1 004887 


1400 


PLACE 1005003 


F-PLACE1 005003 


650 


R-PLACE1 005003 


1401 


f k ■ A rf— X pi ■ ■ A — k ^k >X AX 

PLACE 1005005 


F-PLACE1 005005 


651 


■ k k — k ■ k jk Mmmmr a A — — «» 

R-PLACE1 005005 


1402 


■ V ■ A a^x f I d *x A ^— ■ 

PLACE 1005031 


F- PLACE 1005031 


652 


R-PLACE1 005031 


1403 


PLACE 1005239 


F- PLACE 1005239 


653 


R-PLACE1 005239 


1404 


PLACE 1005250 


F-PLACE1 005250 


654 


R-PLACE1 005250 


1405 


■ A ATX ^mmm d mm — — fc — 

PLACE 1005383 


F-PLACE1 005383 


655 


R-PLACE1 005383 


1406 


ft A ft^™ _ft A ax. ^™ A A A 

PLACE 1005410 


■ A x AX A*-* A J AX 

F-PLACE1 005410 


656 


m \. Ammmm. B k .mmm* m am*. ~ A»mm k ■ 

R-PLACE1 005410 


1407 


I^N 1 A A^*| ft^ - J A A A — ik 

PLACE 1005426 


| X \ mmmm -At AX AX ^» A AX AX 

F-PLACE1 005426 


657 


R-PLACE1 005426 


1408 


f V i A ^X J AA, A J Al 

PLACE 1005519 


1 A A X m mmmm " _J ^X AX a— _J 

F-PLACE1005519 


658 


J X ^-tm. ft A A Wmamt M, Amm. A «^ A» A AX 

R-PLACE1005519 


1409 


PLACE 1005539 


F-PLACE1 005539 


659 


R-PLACE1 005539 


1410 


PLACE 1005544 


F-PLACE1 005544 


660 


R-PLACE1 005544 


1411 


PLACE 1005569 


F-PLACE1 005569 


661 


R-PLACE1 005569 


1412 


ft A <*t J A A A A A 

PLACE 1005601 


t^-** ft A A***. ' mmaU> -ft AX A A\ a mW AX AX J 

F-PLACE1 005601 


662 


R-PLACE1 005601 


1413 


PLACE 1005660 


F-PLACE1 005660 


663 


R-PLACE1 005660 


1414 


PLACE 1005669 


F- PLACE 1005669 


664 


R-PLACE1 005669 


1415 


PLACE 1005682 


F-PLACE1 005682 


665 


R-PLACE1 005682 


1416 


PLACE 1005725 


F-PLACE1 005725 


666 


R-PLACE1 005725 


1417 


PLACE 1005736 


F-PLACE1 005736 


667 


R-PLACE1 005736 


1418 


PLACE 1005745 


F-PLACE1 005745 


668 


R-PLACE1 005745 


1419 


PLACE 1005768 


F-PLACE1 005768 


669 


R-PLACE1 005768 


1420 


PLACE1 005815 


F-PLACE1005815 


670 


R-PLACE1005815 


1421 


PLACE 1005878 


F-PLACE1 005878 


671 


R-PLACE1 005878 


1422 
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Table 1 (continued) 



Correspondence 

Name of clone 
PLACE 1005927 
PLACE 1006071 
PLACE 1006073 
PLACE 1006079 
PLACE 1006093 
PLACE 1006208 
PLACE 1006219 
PLACE 1006277 
PLACE 1006290 
PLACE 1006443 
PLACE 1006515 
PLACE 1006716 
PLACE 1006786 
PLACE 1006809 
PLACE 1006959 
PLACE 1007028 
PLACE 1007040 
PLACE 1 007077 
PLACE 1007081 
PLACE 1007096 
PLACE 1007296 
PLACE 1007591 
PUVCE 1007626 
PLACE 1007702 
PLACE1 007845 
PLACE1 007881 
PLACE 1007971 
PLACE 1008282 
PLACE 1008297 
PLACE 1008359 
PLACE 1008469 
PLACE 1008549 
PLACE 1008657 
PLACE 1008716 
PLACE 1008744 
PLACE 1008984 
PLACE 1008985 
PLACE 1009067 
PLACE 1009196 
PLACE 1009279 
PLACE 1009527 
PLACE 1009546 
PLACE 1009600 
PLACE 1009735 
PLACE 1009982 
PLACE1 010011 
PLACE1010078 
PLACE1010081 



between names of done and the sequence name, and the SEQ ID. 



Name of 5'-sequence [ SEQ ID 

F-PLACE1 005927 672 

F-PLACE1 006071 673 

F-PLACE1 006073 674 

F-PLACE1 006079 675 

F-PLACE1 006093 676 

F-PLACE1 006208 677 

F-PLACE1006219 678 

F-PLACE1 006277 679 

F-PLACE1 006290 680 

F-PLACE1 006443 681 

F-PLACE1 006515 682 

F-PLACE1006716 683 

F-PLACE1 006786 684 

F-PLACE1 006809 685 

F-PLACE1 006959 686 

F-PLACE1 007028 687 

F-PLACE1 007040 688 

F-PLACE1 007077 689 

F-PLACE1 007081 690 

F-PLACE1 007096 691 

F-PLACE1 007296 692 

F-PLACE1 007591 693 

F-PLACE1 007626 694 

F-PLACE1 007702 695 

F-PLACE1 007845 696 

F-PLACE1 007881 697 

F-PLACE1 007971 698 

F-PLACE1 008282 699 

F-PLACE1 008297 700 

F-PLACE1 008359 701 

F-PLACE1 008469 702 

F-PLACE1 008549 703 

F-PLACE1 008657 704 

F-PLACE1008716 705 

F-PLACE1 008744 706 

F-PLACE1 008984 707 

F-PLACE1 008985 708 

F-PLACE1 009067 709 

F-PLACE1009196 710 

F-PLACE1 009279 711 

F-PLACE1 009527 712 

F-PLACE1 009546 713 

F-PLACE1 009600 j 714 

F-PLACE1 009735 715 

F-PLACE1 009982 716 

F-PLACE1010011 717 

F-PLACE1010078 718 

F-PLACE1010081 | 719 



3 Name of 3*-sequence I SEQ ID 

R-PLACE1 005927 1423 

R-PLACE1 006071 j 1424 

R-PLACE1 006073 1425 

R-PLACE1 006079 1426 

R-PLACE1 006093 1427 

R-nnnnnnnnnnnn 1428 

R-PLACE1006219 1429 

R-PLACE1 006277 1430 

R-PLACE1 006290 1431 

R-PLACE1 006443 1432 

R-PLACE1 006515 1433 

R-PLACE1006716 1434 

R-PLACE1 006786 1435 

R-PLACE1 006809 1436 

R-PLACE1 006959 1437 

R-PLACE1 007028 1438 

R-PLACE1 007040 1439 

R-PLACE 1007077 1440 

R-PLACE1 007081 1441 

R-PLACE1 007096 1442 

R-PLACE1 007296 1443 

R-PLACE1 007591 1444 

R-PLACE1 007626 1445 

R-PLACE1 007702 1446 

R-PLACE1 007845 1447 

R-PLACE1 007881 1448 

R-PLACE1 007971 1449 

R-PLACE1 008282 1450 

R-PLACE1 008297 1451 

R-PLACE1 008359 1452 

R-PLACE1 008469 1453 

R-PLACE1 008549 1454 

R-PLACE1 008657 1455 

R-PLACE1008716 1456 

R-PLACE1 008744 1457 

R-PLACE1 008984 1458 

R-PLACE1 008985 1459 

R-PLACE1 009067 1460 

R-PLACE1009196 1461 

R-PLACE1 009279 1462 

R-PLACE1009527 1463 

R-PLACE1 009546 1464 

R-PLACE1 009600 1465 

R-PLACE1 009735 1466 

R-nnnnnnnnnnnn 1467 

R-PLACE1010011 1468 

R-PLACE1010078 1469 
R-PLACE1010081 I 1470 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 





Name of clone 


Name of 5' -sequence 


SEQ ID 


3 Name of 3*-sequence 


SEQ ID 


5 


PLACE 1010251 


F-PLACE1010251 


720 


R-PLACE1010251 


1471 




PLACE1010445 


F-PLACE1010445 


721 


R-PLACE1010445 


1472 




PLACE 1010713 


F-PLACE1010713 


722 


R-PLACE1010713 


1473 




PLACE 1010784 


F-PLACE1010784 


723 


R-PLACE1010784 


1474 


10 


PLACE 1010827 


F-PLACE1010827 


724 


R-PLACE1010827 


1475 


PLACE 1010968 


F-PLACE1010968 


725 


R-PLACE1010968 


1476 




PLACE 1011045 


F-PLACE1011045 


726 


R-PLACE1011045 


1477 




PLACE1011116 


F-PLACE1011116 


727 


R-PLACE1011116 


1478 




PLACE1011181 


F-PLACE1011181 


728 






15 


PLACE1011236 


F-PLACE1011236 


729 


R-PLACE1011236 


1479 




PLACE 1011364 


F-PLACE1011364 


730 


R-PLACE1011364 


1480 




PLACE 1 011407 


F-PLACE1011407 

w w ■ ■ wmmm 9 9 w ■ ^mr W 


731 


R-PLACE1011407 

» ■ 9mmmmm W ^mr 9mmm 9 9 9 9 mjmr * 


1481 




PLACE1011516 


F-PLACE1011516 


732 


R-PLACE1011516 


1482 


20 


PLACE 1011708 


F-PLACE1011708 


733 


R-PLACE1011708 


1483 


PLACE 1011824 


F-PLACE1011824 


734 


R-PLACE1011824 


1484 




PLACE 1011978 


F-PLACE1011978 


735 


R-PLACE1011978 


1485 




PLACE2000118 


F-PLACE2000118 


736 


R-PLACE2000118 


1486 




PLACE2000219 


F-PLACE2000219 


737 


R-PLACE2000219 

9 W 9 V^_V 1 V^_^_ ^mr ^mr ^mr V^_ 1 


1487 


25 


PLACE3000181 


F-PLACE3000181 


738 


R-PLACE3000181 


1488 




PLACE3000213 


F-PLACE3000213 


739 


R-PLACE3000213 


1489 

9 T **** 




PLACE4000354 


F-PLACE4000354 


740 


R-PLACE4000354 


1490 




PLACE4000455 


F-PLACE4000455 


741 


R-PLACE4000455 


1491 


30 


SKNMC1 000004 


F-SKNMC 1000004 


742 






SKNMC1000014 
SKNMC1 000082 


F-SKNMC1000014 
F-SKNMC 1000082 


743 
744 








THYR01 000036 


F-THYR01 000036 

9 ■ 9 ■ 9 ¥ W^BW 9 ^mT ^mf *mmr *m* ^m* *mmW 


745 


R-THYROI 000036 


1492 




THY R0 1000061 


F-THYR01 000061 

■ 9 V ■ 1 B | V X^ ■ 


746 


R-THYROI 000061 

V VJ 9 9 1 V 1 ■ I N» ^m* m 4mT *m*r *+mW w 


1493 


35 


THY R0 1000099 


F-THYR01 000099 


747 


R-THYROI 000099 


1494 




THYRO1000196 


F-THYRO1000196 


748 


R-THYRO1000196 


1495 




THYR0 1000400 


F-THYR01 000400 

V V 1 ■ 9 W 9 ^mmW t ^m* *)mr ^mT f ^mW ^mr 


749 


R-THYROI 000400 

V VJ % 9 W 9 t m ^mmW 9 *mmr w ^mw T «v W 


1496 




THYROI 000580 


F-THYROI 000580 


750 


R-THYROI 000580 


1497 


40 


THYR01 000584 


F-THYR01 000584 

■ V V V ■ V V V >mf X^ S-' I 


751 


R-THYROI 000584 

V ■ » 1 V V V t ^*mW 1 X^ »^ « 


1498 


THYR0 1000678 


F-THYR01 000678 


752 


R-THYROI 000678 


1499 




THY R0 1000776 


F-THYR01 000776 


753 


R-THYROI 000776 


1500 




THYR0 1000795 


F-THYR0 1000795 


754 


R-THYROI 000795 


1501 




THYR0 1000846 


F-THYR0 1000846 

■ ■ 9 1 4 9 ■ X m* 9 *tmT m mtr ^mf 9+** T X-/ 


755 


R-THYROI 000846 


1502 

1 *»-r W V— 


45 


THYR0 1000866 

V V V » V • V >^ »^ X^ 


F-THYR0 1000866 

■ Villi! ^ m* I V 


756 


R-THYR01 000866 


1503 




THYR01 000956 


F-THYROI 000956 

■ V V 1 1 IV V X^ X^ X^ Vd* 


757 


R-THYROI 000956 


1504 




THYR0 1000964 


F-THYR01 000964 


758 


R-THYROI 000964 


1505 




THYR0 1000999 


F-THYROI 000999 


759 


R-THYROI 000999 


1506 


50 


THYRO1001063 


F-THYR0 100 1063 


760 


R-THYRO1001063 


1507 


THYRO1001071 


F-THYRO1001071 


761 


R-THYRO1001071 


1508 




THYR01001102 


F-THYRO1001102 


762 


R-THYRO1001102 


1509 




THYRO1001113 


F-THYRO1001113 


763 


R-THYRO1001113 


1510 




THYR01 001128 


F-THYRO1001128 


764 


R-THYRO1001128 


1511 


55 


THYRO1001205 


F-THYRO1001205 


765 


R-THYRO1001205 


1512 




THYRO1001237 


F-THYR0 100 1237 


766 


R-THYRO1001237 


1513 




THYR0 1001 242 


F-THYRO1001242 


767 


R-THYRO1001242 


1514 
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Table 1 (continued) 



Correspondence between names of clone and the sequence name, and the SEQ ID. 



Name of clone 


Name of 5'-seoLience 


sfo in 


Q Mama Q* mm mn^n 

«j iMdjfitj 01 0 -secjuence 


oca in 


THYR01 001 Pfifi 


P-TWYRrii nm 9rr 
r i n t rSKJ i uu i <iod 


/bo 


H- 1 HYRO 1 00 1 266 


1515 


TH YR0 1 00 1 397 


F.THYRH1 nm ^97 
i l n i nkJ 1 UU 1 o£/ 


7CQ 

/oy 


hi- 1 HYHOi 001 327 


4 r~ 4 A 

1516 


THYRO100145R 


F-THYR01 nniA^R 
r mi r\\j i uu i hod 


77n 
/ /U 


H- 1 HYnO 1001456 


4^4 — r 

1517 


THYR0 1001457 


r i n i nu i uu i *to / 


771 
/ /l 


! n- 1 nYHO1001457 


4 f~ 4 A 

1518 


THYRO1001471 

■ ■III IV-/ 1 V/Vs It/ 1 


F-THYRO.1 00 1471 
i i n t r\KJ i uu 1 h f I 


770 
lid 


D TUVDAH AAH A~r + 

H-l HYnO 1 001471 


^ r~ 4 a 

1519 


THYRO100147R 

1 1 1 1 i ivy 1 W 1"+/ o 


F-TMYRn 1 nn 1 a~7r 


I/O 


p> TUN/OA 4 aa4 jnn 

I H-T HYR0 1001 478 


4 f - AA 

1520 


THYRO1Q014Q5 


F-THYR01 001/1Q(; 
r i n t I uu I *tyo 


f /4 


n- 1 HYHOI 001 495 


4 A 4 

1521 


THYROI 001^9*3 


F-TMYRH1 nm koq 
i i n t nw i uu i o^o 


/ /O 


n-THYROl 001 523 


1522 


THYRO10015PQ 


F-THYRH1 nm wq 
r mi nU J UU I D<i y 


/ /D 


R-THYR0 1001529 


1523 


THYRO10015Q3 


• i n i nu i uu 1 oyo 


777 
Iff 


D TUVDA <t aa 4 rno 

R- THYROI 001593 


4 PA 4 

1524 


THYRO1Q01608 


F-TH Y RD 1 00 1 ftHQ 
r i n i i uu I duo 


77Q 
/ /O 


R- 1 HYROI 001 608 


4 r r» r 

1525 


THYROI 001 R41 

1 1 1 1 1 i\S 1 UU 1 U*+ 1 


F.THVRH1 nm ft/f i 
I'lnT r\\Jl uul D^H 


770 

/ /9 


D TTI_I V/ n/"^ 4 /"\ A\ 4 0 4 4 

R-THYR01 001 641 


1526 


THYROI 001 70O 


F.tuv dah nn 1 7nn 
r* 1 rl T rlVJ 1 UU 1 / UU 


/OU 


R-TH YR0 1001 700 


1527 


THYROI 001 709 


F-TMYRfM nm vno 
r-lnT riLJl UU1 /U^ 


/Ol 


R-THYR01 001 702 


1528 


THYROI 001 79R 


F-THYR01 nni TOC 
r*inT nui uu 1 /^o 


TOO 


O TUVF>A< 7r>r 

R-THYRO 1 001 725 


1529 


THYRO1001770 


F-TH Y rhi nn 1 77r» 
I'lnT nu i uu 1 1 f u 


7QQ 

/oo 


f~> TUVDA 4 AA4 ~T~T£\ 

R-I HYRO1001770 


4 rAA 

1530 


THYROI 001 fiO^ 
iiii nu i uu i ouo 


F-TWYR01 nm Qn*a 

r*lni UU I UUl OUO 


/o4 


R-THYR0 1 001 803 


1531 


Y7QAA1 0000*30 


r- Y / aMMl UUUUoU 


70c 

785 


R-Y79AA1 000030 


1532 


Y7QAA10001 97 

I r 1 UUU 1 C. 1 


r- T / aMAl Uuu l £.( 


786 


R-Y79AA1 0001 27 


1533 


Y7QAA1 00O907 


P V7QA A innnon7 

r-T / y/\A 1 UUUiiU/ 


TO "7 

787 


R-Y79AA1 000207 


1534 


Y7QAA1 nnnoQR 


C V7Q A A 1 nonooe 
r- Y /yAAl UuU^o 


—7 j-\ 
788 


R-Y79AA1 000226 


1535 


Y79AA1 000970 


F-Y7Q A A 1 onnoTn 

r- T / 3nrt 1 UUU^/U 


/oy 


R-Y79AA1 000270 


1536 


Y79AA100049R 


F-Y7Q A A 1 nnn/ioft 

r- T / 3MM I UUU^cD 


7AA 

/yO 


R-Y79AA1 000426 


1537 


Y7QA A 1000^91 

I / v7/A/A 1 UUUD^ 1 


C VTQAAinnncoH 


791 


R-Y79AA1 000521 


1538 


Y7QA A 1 0007^0 

I / J7/"\A> 1 UUU/ OU 


C V7Q am nnn7cn 
r- Y /yAAl UUU/Ou 


792 


R-Y79AA1 000750 


1539 


Y7QAA1 00077R 


r-T /yAA lUuu/ /D 


/93 


R-Y79AA1 000776 


1540 


Y7QAA 1000777 


F- Y7Q A A 1 nnnT77 
i T / yAA 1 UUU / / / 


70 A 

/y4 


D V7r\ A A 4 AAm^^ 

R-Y79AA1 000777 


1541 


Y79AA1000R7R 


F.V7QAA 1 nnnD7c 

r- T / yAA I UUUO /D 


70C 

/95 


R-Y79AA1 000876 


1542 


Y7QAA1 OOORRfl 


F.V7Q A A 1 nnnofiQ 

r - T / yAA 1 UUUoOO 


70C 

/9b 






Y7QAA1000Q^Q 

1 / <Ji\r\ 1 UUU\7\JCJ 


F V7QA A i nnnncn 
it/ yAA i uuuyoy 


19/ 


R-Y79AA 1000959 


1543 


Y7QAA1 OOOQR7 


P V7Q A A i nnnoc7 
i t /yAA luuuyb/ 


798 


R-Y79AA1 000967 


1544 


Y7QAA1001011 

l / <jr\r\ \ uu IU iO 


F Vtqa A 1 nm ni *a 
r-T /yAATUUlu lo 


/99 


JP^ V/^rt A A 41 AA J A J A 

R-Y79AA1 001 01 3 


1545 


Y7QA a 1 nm 


C V7QA A ^ r>rvH Ate 
r-T / yAAl uul 056 


800 


R-Y79AA1001056 


1546 


Y7QAA1 001 OR9 


F V7Q A A 1 nn*t nco 

P- T / yAA 1 UU 1 uO<i 


O AH 
801 


n V/T^rt A A 4 AA J AAA 

R-Y79AA1 001 062 


1547 


Y7QAA1 001 0Q0 


F-Y7QA A 1 nm non 
i t / yAA I uui uyu 


O02 


r~> \/ "7A A A 4 A A 4 AAA 

R-Y79AA 1 001 090 


1548 


Y7QAA1001919 


F Y7Q A A 1 nni Oi O 
r-T /yAAlUU \d. \c. 


O AO 

803 


t~*\ \ / ~~1 S\ A A 4 A A 4 A 4 A 

R-Y79AA1 001 21 2 


1549 


Y79AA10019R4 


F-Y7Q A A 1 nni Oft/1 

i t / yAA 1 uu i cjy\ 


804 


f> V/ "T^S A A 41 AA 4 A A 4 

R-Y79AA1001264 


1550 


Y7QAA1001979 

I / C7/A/A 1 UU 1 § £. 


P V70AAinnHQ70 

r-T /yAAluul^/^ 


one 

805 


R-Y79AA1001272 


1551 


Y7QAA1 001 ^9°. 

I / \j r\r\ | UU 1 O ^_ O 


F.V7QAA 1 nni QOQ 
r- J /yAA I UU I o^o 


806 


r~S \/ ~ > A A A 4 A A 4 AAA 

R-Y79AA1 001 328 


1552 


Y79AA1001426 


F-Y79AA1001426 


807 


R-Y79AA1001426 


1553 

R N»V X-/ 


Y79AA1001427 


F-Y79AA1001427 


808 






Y79AA1001430 


F-Y79AA1001430 


809 


R-Y79AA1001430 


1554 


Y79AA1001523 


F-Y79AA1001523 


810 


R-Y79AA1001523 


1555 


Y79AA1001530 


F-Y79AA1001530 


811 


R-Y79AA1001530 


1556 


Y79AA1001592 


F-Y79AA1001592 


812 


R-Y79AA1001592 


1557 


Y79AA1001727 


F-Y79AA1001727 


813 


R-Y79AA1001727 


1558 


Y79AA1001787 


F-Y79AA1001787 


814 


R-Y79AA1001787 


1559 


Y79AA1001793 


F-Y79AA1001793 


815 
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Table 1 (continued) 



10 



20 



Correspondence between names of clone and the sequence name, and the SEQ ID. 


Name of clone 


Name of 5'-seauence 


SEQ ID 


3 Name of 3'-seauence 


SEQ ID 


Y7QAA1 001 7Q^ 


F-Y7QA A1 001 7QR 


O I D 


R-Y7QAA1 001 7Q^ 


I ODU 


Y7QAA10017QQ 


F-Y7QAA1 001 7QQ 


R1 7 

O I / 


R-Y7QAA1 001 7QQ 


1 ^R1 


Y7QAA1001R0^ 


F-Y7QAA1 001 ROa 

11/ C7rAAA | UU 1 OUO 


R1R 
O I o 


R-Y7QAA1 001 RO^ 
n i t <jr\r\ i uu i ouo 


1 GkR9 


Y7QAA1001Rfi^ 


F-Y7QAA1 001 RfH 

1 I / C7rA/» 1 UU 1 OUO 


R1Q 


R-Y7QAA1 001 RR^ 

111 / J7r\n 1 UU 1 ouo 


1 RRa 
1 ouo 


Y7QAA100P0?? 


F-Y7QAA1 009099 


RPO 


R-Y7QAA1 009099 


1 5R4 

1 JOH 


Y7Q A A 1 ooporr 


F-Y7QAA1 0090RR 


R91 






Y79AA1002121 


F-Y79AA1 002121 


822 


R-nnnnnnnnnnnn 


1565 


Y79AA1002129 


F-Y79AA1 002129 


823 


R-nnnnnnnnnnnn 


1566 


Y79AA1002213 


F-Y79AA1002213 


824 


R-Y79AA1002213 


1567 


Y79AA1 002334 


F-Y79AA1 002334 


825 


R-Y79AA1 002334 


1568 


Y79AA1 002373 


F-Y79AA1 002373 


826 


R-Y79AA1 002373 


1569 


Y79AA1 002376 


F-Y79A A 1002376 


827 


R-Y79AA1 002376 


1570 


Y79AA1 002378 


F-Y79AA1 002378 


828 


R-Y79AA1 002378 


1571 


Y79AA1 002381 


F-Y79AA1 002381 


829 


R-Y79AA1 002381 


1572 


NT2RP2006580 


F-NT2RP2006580 


2545 


R-NT2RP2006580 


2546 



The sequence name starting from "F" means the name of 5'-end sequence, and the sequence name starting from "R" 
means the name of 3'-end sequence. A blank indicates that the 3'-end sequence corresponding to the 5'-end sequence 
25 has not been determined in the clone. 

[0018] Furthermore, the present invention relates to the use of the above primers, as described below. 

(4) A polynucleotide which can be synthesized with the primer set of (2) or (3). 

(5) A polynucleotide comprising a coding region in the polynucleotide of (4). 
30 (6) A substantially pure protein encoded by polynucleotide of (4). 

(7) A partial peptide of the protein of (6). 

[0019] In addition, the present invention comprises a polynucleotide described below and a protein encoded by the 
polynucleotide. 

35 

(8) An isolated polynucleotide selected from the group consisting of 

(a) a polynucleotide comprising a coding region of the nucleotide sequence set forth in any one of the SEQ 
ID NOs in Table 370; 

40 (b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino acid sequence 

set forth in any one of the SEQ ID NOs in Table 370; 

(c) a polynucleotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences set forth in the SEQ ID NOs in Table 370, in which one or more amino 
acids are substituted, deleted, inserted, and/or added, wherein said protein is functionally equivalent to the 

45 protein comprising said amino acid sequence selected from the amino acid sequences set forth in the SEQ 

ID NOs in Table 370; 

(d) a polynucleotide that hybridizes with a polynucleotide comprising a nucleotide sequence selected from the 
nucleotide sequences set forth in the SEQ ID NOs in Table 370, and that comprises a nucleotide sequence 
encoding a protein functionally equivalent to the protein encoded by the nucleotide sequence selected from 

50 the nucleotide sequences set forth in the SEQ ID NOs in Table 370; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a protein 
encoded by the polynucleotide of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence with at least 70% identity to the nucleotide sequence 
set forth in any one of the SEQ ID NOs in Table 370. 

55 

(9) . A substantially pure protein encoded by the polynucleotide of (8). 

(10) An antibody against the protein or peptide of any one of (6), (7), and (9). 

(11) A vector comprising the polynucleotide of (5) or (8). 
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(12) A transformant carrying the polynucleotide of (5) or (8), or the vector of (11 ). 

(13) A transformant expressively carrying the polynucleotide of (5) or (8) f or the vector of (11 ). 

(14) A method for producing the protein or peptide of any one of (6), (7), and (9), comprising culturing the trans- 
formant of (13) and recovering the expression product. 

(15) An oligonucleotide comprising; the nucleotide sequence set forth in any one of the SEQ ID NOs in Table 370 
or the nucleotide sequence complementary to the complementary strand thereof, wherein said oligonucleotide 
comprises 1 5 nucleotides or more. 

(16) Use of the oligonucleotide of (15) as a primer for synthesizing a polynucleotide. 

(17) Use of the oligonucleotide of (15) as a probe for detecting a gene. 

(1 8) An antisense polynucleotide against the polynucleotide of (8), or the portion thereof. 

(19) A method for synthesizing a polynucleotide, the method comprising: 

a) synthesizing a complementary strand using a cDNA library as a template, and using the primer set of (2) 
or (3), or the primer of (1 6); and 

b) recovering the synthesized product. 

(20) The method of (19), wherein the cDNA library is obtainable by oligo-capping method. 

(21) The method of (19), wherein the complementary strand is obtainable by PCR. 

(22) A method for detecting the polynucleotide of (8), the method comprising: 

a) incubating a target polynucleotide with the oligonucleotide of (1 5) under the conditions where hybridization 
occurs, and 

b) detecting the hybridization of the target polynucleotide with the oligonucleotide of (15). 

(23) A database of polynucleotides and/or proteins, the database comprising information on at least one sequence 
selected from the nucleotide sequences set forth in the SEQ ID NOs in Table 370 and/or the amino acid sequences 
set forth in the SEQ ID NOs in Table 370, or a medium on which the database is stored. 

[0020] Any patents, patent applications, and publications cited herein are incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] 

Figure 1 shows the restriction maps of vectors pME18SFL3 and pUC19FL3. 

Figure 2 shows the reproducibility of gene expression analysis. The ordinate and the abscissa show the intensities 
of gene expression obtained in experiments different from each other. 

Figure 3 shows the detection limit in gene expression analysis. The intensity of expression is shown in the ordinate, 
and the concentration (u.g/ml) of the probe used is shown in the abscissa. 

DETAILED DESCRIPTION OF THE INVENTION 

[0022] Herein, "polynucleotide" is defined as a molecule in which multiple nucleotides are polymerized. There are 
no limitations in the number of the polymerized nucleotides. In case that the polymer contains relatively low number 
of nucleotides, it is also described as an "oligonucleotide". The polynucleotide or the oligonucleotide of the present 
invention can be a natural or chemically synthesized product. Alternatively, it can be synthesized using a template DNA 
by an enzymatic reaction such as PCR. 

[0023] All the cDNA provided by the invention are full-length cDNA. Herein, a "full-length cDNA" is defined as acDNA 
which contains both ATG codon (the translation start site) and the stop codon. Accordingly, the untranslated regions, 
which are originally found in the upstream or downstream of the protein coding region in natural mRNA, may or may 
not be contained. 

An "isolated polynucleotide" is a polynucleotide the structure of which is not identical to that of any naturally occurring 
nucleic acid or to that of any fragment of a naturally occurring genomic nucleic acid spanning more than three separate 
genes. The term therefore covers, for example, 

(a) a DNA which has the sequence of part of a naturally occurring genomic DNA molecule but is not flanked by 
both of the coding sequences that flank that part of the molecule in the genome of the organism in which it naturally 
occurs; 
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(b) a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner 
such that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; 

(c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction 
(PCR), or a restriction fragment; and 

5 (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. Spe- 

cifically excluded from this definition are nucleic acids present in mixtures of different (i) DNA molecules, (ii) trans- 
fected cells, or (iii) cell clones: e.g., as these occur in a DNA library such as a cDNA or genomic DNA library. 

[0024] The term "substantially pure" as used herein in reference to a given polypeptide means that the protein or 
10 polypeptide is substantially free from other biological macromolecules. The substantially pure protein or polypeptide 
is at least 75% (e.g., at least 80, 85, 95 ; or 99%) pure by dry weight. Purity can be measured by any appropriate 
standard method, for example, by column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 
[0025] All the clones of the present invention (830 clones) are novel and covering full-length, and also predicted to 
encode any of the following functional protein: 

15 

secretory proteins, 
membrane proteins, 

proteins associated to signal transduction (signal transduction-associated proteins; e.g. protein kinases., etc.). 
proteins associated to a glycoprotein (glycoprotein-associated proteins), 
20 proteins associated with transcription (transcription-associated proteins), 

proteins associated with diseases (disease-associated proteins), 

or, enzymes and/or metabolism-associated proteins., cell division- and/or cell proliferation-associated proteins, 
cytoskeleton-associated proteins, nuclear proteins, DNA-and/or RNA-binding proteins, ATP- and/or GTP-binding 
proteins, protein synthesis- and/or protein transport-associated proteins, and cellular defense-associated proteins. 

25 

[0026] Furthermore, all the cDNA clones of the present invention can be characterized as follows: 

(1) a cDNA that is obtained by the oligo-capping method, which provides cDNA with high fullness ratio. The cDNA 
was selected by the score in the ATGpr (described as ATGprt, as well), which is a program for prediction of the 

30 fullness of the 5'-end of cDNA based on the features of the 5'-end sequence. In addition, the PSORT, which is a 

program for prediction of the existence of the signal sequence selected, cDNA that contains a signal sequence in 
the 5'-end, or transmembrane region in the protein coding region. Furthermore, the homology search with the 5'- 
end sequences confirmed that, the selected clones were not identical to any of the known human mRNA (namely 
novel); 

35 or, 

(2) a cDNA that is obtained by the oligo-capping method, which provides cDN A with high fullness ratio. The cDNA 
was selected by the score in the ATGpr, which is a program for prediction of the fullness of the 5'-end based on 
the features of the 5'-end sequence. Furthermore, the a cDNA that has relative homology with an amino acid 
sequence of a protein with known functions was selected by the BLAST search (Altschul S.F., Gish W., Miller W., 

40 Myers E.W., and Lipman DJ. (1990) J. Mol. Biol. 215: 403-410 ; Gish W., and States D.J. (1993) Nature Genet. 

3: 266-272) on the SwissProt database using the 5'-end sequence. In addition, the homology search using the 5*- 
end sequence confirmed that the selected clones were not identical to any of the known human mRNA (namely 
novel). 

45 [0027] All clones are obtainable as a full-length clone by such a method as PCR (Current Protocols in Molecular 
Biology, Ausubel et al. edit, (1987) John Wiley & Sons, Section 6.1-6.4) using both the 5'- and 3'-end sequences, or 
using the 5'-end sequence and an oligo-dT primer that corresponds to the polyA sequence. 

[0028] Specifically, PCR can be performed using an oligonucleotide that has 15 nucleotides longer, and specifically 
hybridizes with the complementary strand of the polynucleotide that contains the nucleotide sequence selected from 

50 the 5'-end sequences shown in Table 1 (SEQ ID NO: 1-829, and SEQ ID NO: 2545), and an oligo-dT primer as a 5'-, 
and 3'-primer, respectively. The length of the primers is usually 15-100 bp, and favorably between 15-35 bp. In case 
of LA PCR, which is described below, the primer length of 25-35 bp may provide a good result. 
[0029] A method to design a primer that enables a specific amplification based on the given nucleotide sequence is 
known to those skilled in the art (Current Protocols in Molecular Biology, Ausubel et al. edit, (1 987) John Wiley & Sons, 

55 Section 6.1-6.4). In designing a primer based on the 5'-end sequence, the primer is designed so as that, in principle, 
the amplification products will include the translation start site. Accordingly, in case that a given 5'-end nucleotide 
sequence is the 5'-untranslated region (5'UTR), any part of the sequence can be used as a S'-primer as far as the 
specificity toward the target cDNA is insured. The translation start site can be predicted using a known method such 
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as the ATGpr as described below. 

[0030] When synthesizing a polynucleotide, the target nucleotide sequence to be amplified can extend to several 
thousand bp in some cDNA. However, it is possible to amplify such a long nucleotides by using such as LA PCR (Long 
and Accurate PCR). It is advantageous to use LA PCR when synthesizing long DNA In LA PCR, in which a special 
DNA polymerase having 3' CB' exonuclease activity is used, misincorporated nucleotides can be removed. Accordingly, 
accurate synthesis of the complementary strand can be achieved even with a long nucleotide sequence. By using LA 
PCR, it is reported that amplification of a nucleotide with 20 kb longer can be achieved under desirable condition 
(Takeshi Hayashi (1996) Jikken-lgaku Bessatsu, "Advanced Technologies in PCR" Youdo-sha). 
[0031] A template DNA for synthesizing the cDNA of the present invention can be obtained by using cDN A libraries 
that are prepared by various methods. The full-length cDNA clones obtained here are those with high fullness ratio, 
which were obtained using a combination of (1) a method to prepare a full-length-enriched cDNA library using the 
oligo-capping method, and (2) an estimation system for fullness using the 5*-end sequence (selection based on the 
estimation by the ATGpr after removing clones that are non-full-length compared to the ESTs). However, it is possible 
to easily obtain a full-length cDNA by using the primers that are provided by the present invention, not by the above 
described specialized method. 

[0032] The problem with the cDNA libraries prepared by the known methods or commercially available is that mRNA 
contained in the libraries has very low fullness ratio. Thus, it is difficult to screen full-length cDNA clone directly from 
the library using ordinary cloning methods. The present invention has revealed a primer that is capable of synthesizing 
a full-length cDNA. If provided with primers, it is possible to synthesize a target full-length cDNA by using enzymatic 
reactions such as PCR. In particular, a full-length-enriched cDNA library, synthesized by methods such as oligo-cap- 
ping, is desirable to synthesize a full-length cDNA with more reliability. 

[0033] Once the nucleotide sequences of the full-length cDNAs obtained in the present invention is determined, it is 
possible to predict the functions of the proteins encoded by the cDNA clones, for example, by searching the databases 
such as GenBank (http://www.ncbi.nlm.nih.gov/web/GenBankO, Swiss-Prot (http://www.ebi.ac.uk/ebi_docsSwiss- 
Prot_db/swisshome.html), UniGene (http://www.ncbi.nlm.nih.gov/UniGene) for homologies of the cDNAs, or by search- 
ing the amino acid sequences deduced from the full-length nucleotide sequences for signal sequence by using software 
such as PSORT (K. Nakai & M. Kanehisa, Genomics, 14: 897-991 (1992), for transmembrane region by using software 
such as SOSUI (T. Hirokawa et al., Bioinformatics, 14:378-379 (1998); Mitsui Knowledge Industry Co., Ltd.) or for motif 
by using software such as Pfam (http://www.sanger.ac.uk^ofhware/Pfam/index.shtml) or PROSITE (http://www.ex- 
pasy.ch/prosite). As a matter of course, the functions are often predictable by using partial sequence information (pref- 
erably 300 nucleotides or more) instead of the full-length nucleotide sequences. However, the result of the prediction 
obtained by using partial sequence information does not always agree with the result obtained by using full-length 
nucleotide sequence, and thus it is needless to say that the prediction of function is preferably performed based on 
the full-length nucleotide sequences. 

[0034] Homology search using each of GenBank, Swiss-Prot and UniGene was performed for the 826 clones whose 
full-length nucleotide sequences had been determined (H EM BA1 005337, NT2RM1000407, NT2RM2001 767, and 
NT2RP3003939 are not full-length). The amino acid sequences deduced from the full-length nucleotide sequences 
were searched for functional domains by using analytical software programs ; PSORT, SOSUI and Pfam. Based on the 
results, proteins encoded by the cDNA clones were grouped into some categories and their functions were predicted. 
[0035] The following 437 clones were categorized into secretory and/or membrane proteins. The clones categorized 
into secretory and/or membrane proteins are those which matched the full-length sequences of Swiss-Prot database 
with the keywords "growth factor, "cytokine", "hormone", "signal", "transmembrane", "membrane", "extracellular ma- 
trix", "receptor", "G-protein coupled receptor", "ionic channel", "voltage-gated channel", "calcium channel", "cell adhe- 
sion", "collagen" or "connective tissue"; those which matched the data, suggesting that the proteins are secretory and/ 
or membrane proteins; or those which matched with the full-length sequences of GenBank or UniGene database similar 
description; and, further, those predicted to have an N-terminal signal sequence or a transmembrane region as a result 
of domain search for the amino acid sequences deduced from the full-length nucleotide sequences. 
BNGH41000020, BNGH41000087. BNGH41000091 , HEMBA1000121 , HEMBA1000128, HEMBA1000349 



H EM BA 1000477, 
HEMBA1 000940, 
HEMBA1002163, 
HEMBA1 002421, 
HEMBA 1003530, 
HEMBA1 004391, 
HEMBA1 005070, 
HEMBA1006171, 
HEMBA1 006482, 
HEMBA1 007241, 



HEMBA1 000590 
HEMBA1 000962 
HEMBA1002167 
HEMBA1 002767 
HEMBA1 003602 
H EM BA1 004444 
HEMBA1 005449 
HEMBA1 006299 
HEMBA1 006707 
HEMBB1 000407 



HEMBA1000713, 
HEMBA1001221, 
HEMBA1002178, 
HEMBA1 003047, 
HEMBA1 003732, 
HEMBA1 004454, 
HEMBA1 005522, 
HEMBA1 006311, 
HEMBA 1006724, 
H EM BB1 000447, 



HEMBA1000732, 
HEMBA1001228, 
HEMBA1 002195, 
HEMBA1003101, 
HEMBA1 003945, 
HEMBA 1004505, 
HEMBA1 005545, 
HEMBA1 006335, 
HEMBA1 006749, 
HEMBB1 000567, 



HEMBA1 000745, 
HEMBA1001621, 
HEMBA1 002227, 
HEMBA1 003230, 
HEMBA1004110, 
HEMBA 1004797, 
HEMBA1 005698, 
HEMBA1 006357, 
HEMBA1 006902, 
HEMBB1 000679, 



HEMBA1 000835, 
HEMBA1002131, 
HEMBA1 002420, 
HEMBA1 003392, 
HEMBA1 004250, 
HEMBA1 004982, 
HEMBA1 005945, 
HEMBA1 006430, 
HEMBA1 006960, 
HEMBB1 000881, 
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HEMBB1001026, HEMBB1001048 5 HEMBB1 001407, HEMBB1001530, HEMBB1001573, HEMBB1001847, 

HEMBB1001978, HEMBB1002041 , HEMBB1002162, HEMBB1002245, HEMBB1002427, HEMBB1002693, 

MAMMA1000102, MAMMA1000106, MAMMA1000118, MAMMA1000141 , MAMMA1 000204, MAM MA1 000226, 

MAMMA1 000457, MAMMA1 000473, MAMMA1 000496, MAMMA1 000591 , MAMMA1 000681 , MAMMA1 000810, 
MAMMA1 000986, MAMMA1 000994, MAMMA1001043, MAMMA1001141 , MAMMA1001237, MAMMA1001344, 

MAMMA1001418, MAMMA1001893, MAMMA1001957, MAMMA1001978, 
MAMMA1 002070, MAMMA1 002091 , MAMMA1 002095, MAMMA1002165, MAMMA1 002234 



MAMMA1 002633, MAMMA1 0031 26, 



NT2RM1 000858 

10 NT2RM2001626 
NT2RM4000417 
NT2RM4000997 
NT2RP1 000050 
NT2RP1 000551 

15 NT2RP2000533 
NT2RP2000903 
NT2RP2001755 
NT2RP2002527 
NT2RP2003042 

20 NT2RP2003655 
NT2RP2004524 
NT2RP2005163 
NT2RP2005666 
NT2RP2006042 

25 NT2RP3000304 
NT2RP3000818 
NT2RP3001240 
NT2RP3001650 
NT2RP3002342 

30 NT2RP3002958 
NT2RP3003614 
NT2RP3004309 
NT2RP3004647 
NT2RP4002451 

35 OVARC1000313 
OVARC 1000956 
OVARC1001725 
PLACE1000912 
PLACE1001231 

40 PLACE1001536 
PLACE1001836 
PLACE1 002374 
PLACE1 002967 
PLACE1 003845 

45 PLACE1 004482 
PLACE1 005003 
PLACE1 005544 
PLACE1 005927 
PLACE1 006443 

50 PLACE1 007296 
PLACE1008716 
PLACE1 009527 
PLACE1010251 
PLACE1011116, 

55 PLACE3000213, 
THYROI 000099. 



NT2RM1 000462, 
NT2RM2000241 , 
NT2RM2001939, 
NT2RM4000587, 
NT2RM4001735, 
NT2RP1 000261, 
NT2RP1 000981, 
NT2RP2000649, 
NT2RP2001276, 
NT2RP2001956, 
NT2RP2002721 , 
NT2RP2003383, 
NT2RP2003950, 
NT2RP2004606, 
NT2RP2005378, 
NT2RP2005883, NT2RP2005994, NT2RP2006004, 

NT2RP2006269, NT2RP1 00651 2, NT2RP2006580, 

NT2RP3000460, 
NT2RP3000921 , 
NT2RP3001322, 
NT2RP3001858, 
NT2RP3002737, 
NT2RP3003076, 
NT2RP3004075, 
NT2RP3004406, 
NT2RP4001009, 
OVARC1 000003, 
OVARC 1000439, 
OVARC1001163, 
PLACE1 000033, 
PLACE1 000927, 
PLACE1001401, 
PLACE1001564, PLACE 1001 655, PLACE1001 795, 

PLACE1 001 91 8, PLACE1 001 949, PLACE1 002080, 

PLACE1 002547, 
PLACE 1003573, 
PLACE1 004279, 
PLACE1 004630, 
PLACE1 005031, 
PLACE1 005660, 
PLACE1 006093, 
PLACE 1006809, 
PLACE1 007845, 
PLACE 1008985, 
PLACE1 009600, 
PLACE1010713, 
PLACE1011236 : 
SKNMC1 000004, 
THYROI 000400, 



NT2RM 1000899, 
NT2RM2001792, 
NT2RM4000444, 
NT2RM4001325, 
NT2RP1000181, 
NT2RP1000613, 
NT2RP2000616, 
NT2RP2001200, 
NT2RP2001915, 
NT2RP2002533, 
NT2RP2003210, 
NT2RP2003664, 
NT2RP2004556, 
NT2RP2005181, 



NT2RP3000436, 

NT2RP3000907, 

NT2RP3001271, 

NT2RP3001738, 

NT2RP3002411, 

NT2RP3003000, 

NT2RP3004025, 

NT2RP3004345, 

NT2RP4001001, 

NT2RP4002750, 

OVARC1 000410, 

OVARC1001030, 

OVARC1001991, 

PLACE1000914, 

PLACE1001340, 



PLACE1002518, 
PLACE 1003407, 
PLACE1 003852, 
PLACE1 004520, 
PLACE1 005005, 
PLACE1 005569, 
PLACE1 006071, 
PLACE1006716, 
PLACE1 007626, 
PLACE1 008744, 
PLACE1 009546, 
PLACE1010445, 
PLACE1011181, 
PLACE4000354, 
THYRO1000196, 



NT2RM1 000542, 
NT2RM2000410, 
NT2RM2001941, 
NT2RM4000593, 
NT2RM4001768, 
NT2RP1 000300, 
NT2RP1 001004, 
NT2RP2000663, 
NT2RP2001480, 
NT2RP2002063, 
NT2RP2002824, 
NT2RP2003390, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005463, 



NT2RP3000616, 
NT2RP3001012, 
NT2RP3001388, 
NT2RP3002015, 
NT2RP3002790, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP3004481, 
NT2RP4001467, 
OVARC1000105, 
OVARC1 000553, 
OVARC1001336, 
PLACE 1000231, 
PLACE1001016, 
PLACE1001407, 



PLACE1 002726, 
PLACE 1003737, 
PLACE 1004282, 
PLACE 1004657, 
PLACE 1005383, 
PLACE 1005725, 
PLACE 1006208, 
PLACE 1006959, 
PLACE 1007881, 
PLACE 1009067, 
PLACE1 009982, 
PLACE1010784, 
PLACE1011516, 
SKNMC1000014, 
THYRO1000584, 



NT2RM1 000580 
NT2RM2000423 
NT2RM4000198 
NT2RM4000648 
NT2RM4001843 
NT2RP1 000325 
NT2RP1001563 
NT2RP2000694 
NT2RP2001495 
NT2RP2002188 
NT2RP2002942 
NT2RP2003469 
NT2RP2004205 
NT2RP2004794 
NT2RP2005541 

NT2RP3000169 
NT2RP3000676 
NT2RP3001159 
NT2RP3001560 
NT2RP3002160 
NT2RP3002836 
NT2RP3003532 
NT2RP3004130 
NT2RP3004552 
NT2RP4001879 
OVARC1 000298 
OVARC1 000811 
OVARC1 001570 
PLACE1 000560 
PLACE1001123 
PLACE1001464 

PLACE1 002095 
PLACE1 002905 
PLACE 1003772 
PLACE 1004441 
PLACE 1004648 
PLACE 1005410 
PLACE 1005745 
PLACE 1006277 
PLACE 1007081 
PLACE1 008359 
PLACE1009196 
PLACE1010011 
PLACE1010827 
PLACE1011708 
SKNMC1 000082 
THYR01 000678 



MAMMA1 002586 
NT2RM1 000855 
NT2RM2000565 
NT2RM4000284 
NT2RM4000761 
NT2RM4002352 
NT2RP1 000448 
NT2RP2000479 
NT2RP2000818 
NT2RP2001514 
NT2RP2002232 
NT2RP2002976 
NT2RP2003593 
NT2RP2004495 
NT2RP2005027 
NT2RP2005597 

NT2RP3000171 
NT2RP3000721 
NT2RP3001195 
NT2RP3001592 
NT2RP3002311 
NT2RP3002900 
NT2RP3003535 
NT2RP3004133 
NT2RP3004625 
NT2RP4002187 
OVARC 1000307 
OVARC1 000873 
OVARC1001607 
PLACE 1000740 
PLACE1001183 
PLACE1001516 

PLACE1 002355 
PLACE1 002911 
PLACE1 003839 
PLACE1 004450 
PLACE1004816 
PLACE1 005426 
PLACE1 005878 
PLACE1 006290 
PLACE1 007096 
PLACE1 008469 
PLACE1 009279 
PLACE1010078 
PLACE1010968 
PLACE3000181 
THYROI 000036 
THYP01 000776 



THYRO1000795, THYRO1000956, THYRO1001102, THYRO1001113, 
THYRO1001205, THYRO1001237, THYRO1001242, THYRO1001266, 



THYR0 1 001 327, THYROI 001 456, 
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THYR01 001 478, THYR01 001 523, 
Y79AA1 000207, Y79AA1 000226, 



Y79AA1 000959, 
Y79AA1001427, 
Y79AA 1001 803, 



THYRO1001529, 
Y79AA1 000270, 
Y79AA1 001013, 
Y79AA1001430, 
Y79AA1 002022, 



THYRO1001641, 
Y79AA1 000426, 
Y79AA1001212, 
Y79AA1001727, 
Y79AA1 002058, 



THYRO1001702, 
Y79AA 1000521, 
Y79AA1001264, 
Y79AA1001787, 
Y79AA1 002129. 



THYRO1001725, 
Y79AA1 000876, 
Y79AA1001328, 
Y79AA1001795, 
Y79AA1002213, 



Y79AA1 000888, 
Y79AA1001426, 
Y79AA1001799, 
Y79AA1 002373, 

[0036] The following 1 46 clones were categorized into glycoprotein-associated proteins. The clones categorized into 
glycoprotein-associated proteins are those which matched the full-length sequences of Swiss-Prot database with the 
keyword "glycoprotein"; those which matched the data suggesting that the proteins are glycoprotein; or those which 
matched the full-length sequences of GenBank or UniGene database with similar description. 
BNGH41000087, BNGH41000091 , HEMBA1000349, HEMBA1000590, HEMBA1000745, 

HEMBA1 001621, HEMBA1002131 , HEMBA1002178! 
HEMBA1003392, HEMBA1004250, HEMBA1004391, 
HEMBA1005522, HEMBA1005545, HEMBA1006707,' 
HEMBB1000881, HEMBB1001048, HEMBB1002120, 



HEMBA1 001221, HEMBA1001228, 

HEMBA1 002767, HEMBA1 003230, 

HEMBA1 004505, HEMBA1005449, 

HEMBA1 006902, HEMBB1 000679 



HEMBA1 000835, 
H EM BA 1002421, 
HEMBA1 004444, 
HEMBA1 006749, 
HEMBB1 002245, 



HEMBB1002427, MAMMA1000102, MAMMA1000591, MAMMA1000681 [ MAMMA1001043' MAMMA1001237 

MA AAA J A H AAOATA tlAtJllij AAAr - nn ......... . _ - . ' 1 



MAMMA1 002070, MAMMA1 002586 
NT2RM2001 81 8, NT2RM2001 939, 



NT2RM4000648, 
NT2RP1 001004, 
NT2RP2002533, 
NT2RP2005181, 
NT2RP2006269, 
NT2RP3002737, 
NT2RP3004481 , 
OVARC1 000003, 



NT2RM4000997, 

NT2RP2000616, 

NT2RP2003042, 

NT2RP2005541, 

NT2RP3000304, 

NT2RP3002958, 

NT2RP3004552, 

OVARC1 000553, 



MAMMA1003126, NT2RM 1000462. NT2RM1000580, 
NT2RM2001 941 , NT2RM40001 98, NT2RM4000284, 



NT2RM4001325, 
NT2RP2000694, 
NT2RP2003210, 
NT2RP2005597, 
NT2RP3000616, 
NT2RP3003000, 
NT2RP3004640, 
OVARC1 000811, 



NT2RM4002352, 
NT2RP2000903, 
NT2RP2004205, 
NT2RP2005883, 
NT2RP3000921, 
NT2RP3003532, 
NT2RP4000108, 
OVARC1 000873, 



OVARC1001991, PLACE1000033, PLACE 1000740, PLACE1001016, 



PLACE1001123. 
PLACE1 002374, 
PLACE1 003839, 
PLACE1 005003, 
PLACE1 007845, 
PLACE1010968, 
THYRO1000196 



PLACE1001231, 
PLACE1 002905, 
PLACE1 004282, 
PLACE1 005426, 
PLACE1008716, 
PLACE 1011 116, 
THYROI 000584, 



PLACE1001464, 
PLACE1002911, 
PLACE1 004441, 
PLACE1 006071, 
PLACE1 008744, 
PLACE3000181, 
THYR01 000956, 



PLACE1 001655, 
PLACE1 003573, 
PLACE 1004450, 
PLACE 1006073, 
PLACE 1008985, 
PUVCE3000213, 
THYRO1001266, 



NT2RP1000613, 
NT2RP2001480, 
NT2RP2004606, 
NT2RP2006004, 
NT2RP3001650, 
NT2RP3004130, 
NT2RP4001467, 
OVARC1001336, 

PLACE1001836, 
PLACE1 003737, 
PLACE1 004520, 
PLACE1 006290, 
PLACE1010251, 
PLACE4000354, 
Y79AA1 000270, 



NT2RM2001792, 
NT2RM4000417, 
NT2RP1 000981, 
NT2RP2001755, 
NT2RP2005027, 
NT2RP2006042, 
NT2RP3002160, 
NT2RP3004133, 
NT2RP4002750, 
OVARC1001607, 

PLACE1 002355, 
PLACE1 003772, 
PLACE1 004648, 
PLACE1 007081, 
PLACE1010784, 
THYROI 000036, 
Y79AA1 000426, 



Y79AA1001727, Y79AA1001795, Y79AA1002022, Y79AA1002213, 
[0037] The following 57 clones were categorized into signal transduction-associated proteins. The clones categorized 
into signal transduction-associated proteins are those which matched the full-length sequences of Swiss-Prot database 
with the keywords "serine/threonine-protein kinase", "tyros ine-prote in kinase" or "SH3 domain"; those which matched 
the data suggesting that the proteins are signal transduction-associated proteins (for example, "ADP-ribosylation fac- 
tor"); or those which matched the full-length sequences of GenBank or UniGene database with similar description; 
and, further those which was similarly predicted to be signal transduction-associated proteins based on the matchinq 
data of Pfam. 

HEMBA1000006, HEMBA1002195, HEMBA1002227, HEMBA1002551, HEMBA1005084, HEMBA1005929 
HEMBA1006658, HEMBA1006916, MAMMA1000881 , MAMMA1001150, MAMMA1001310 MAMMA1002142' 
NT2RM2001902, NT2RP1001020, NT2RP1001 031 , NT2RP2001469, NT2RP2001529 NT2RP2001769' 



NT2RP2003179, 
NT2RP3000201, 
NT2RP4000634, 
OVARC 1001 569, 
PLACE1 008282, 
THYRO1001593. 



NT2RP2003545, 
NT2RP3000820, 
NT2RP4000962, 
PLACE1 002329, 
PLACE1 008297, 
THYRO1001700, 



NT2RP2004670, 
NT2RP3003527, 
OVARC 1000255, 
PLACE 10031 35, 
PLACE1010081, 
THYRO1001770, 



NT2RP3000011, 
NT2RP3003849, 
OVARC1 000529, 
PLACE 1003598, 
PLACE1011364, 
Y79AA1 000777, 



NT2RP3000022, 
NT2RP3003874, 
OVARC1 00091 6 : 
PLACE1005519, 
PLACE1011824, 
Y79AA1 000967, 



NT2RP3000172, 
NT2RP3004067, 
OVARC1001338, 
PLACE1 006208, 
THYRO1001457, 
Y79AA1 002376, 



Y79AA1 002381, HEMBB1 000668, NT2RM4001377 
[0038] The following 81 clones were categorized into transcription-associated proteins. The clones categorized into 
transcription-associated proteins are those which keywords "transcription regulation", "zinc finger" or "homeobox" 
matched the full-length sequences of Swiss-Prot database; those which the matched the data suggesting that the 
proteins were transcription-associated proteins: or those which matched the full-length sequences of GenBank or Uni- 
Gene database with similar description; and, further, those which was similarly predicted to be transcription- associated 
proteins based on the matching data of Pfam. 
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HEMBA1000462 r HEMBA1000671 , HEMBA1001297, HEMBA1001390, HEMBA1001886, HEMBA1002048, 
HEMBA1003120, HEMBA1003497, HEMBA1004785, HEMBA1005230, HEMBA1005246, HEMBA1006276, 
HEMBA1006572, HEMBA1007226, HEMBB1000106, HEMBB1000905, HEMBB1001959, HEMBB1002051 , 
HEMBB1002661, MAMMA1001 094. MAMMA1001532, MAMMA1001615, NT2RM1000789, NT2RM2000632, 

5 NT2RM2000773, NT2RM4000326, NT2RP1000271 , NT2RP1 000468, NT2RP2000092, NT2RP2000610, 
NT2RP2000712, NT2RP2000739, NT2RP2001538, NT2RP2001662, NT2RP2001817, NT2RP2001948, 
NT2RP2002564, NT2RP2002974, NT2RP2003138, NT2RP2003302, NT2RP2003940, NT2RP2004108, 
NT2RP2004847, NT2RP2005247, NT2RP2005391 , NT2RP2005535, NT2RP2005774, NT2RP2005941 , 
NT2RP2006092, NT2RP3000148, NT2RP3000232, NT2RP3000378, NT2RP3000652, NT2RP3001976, 

10 NT2RP3004090, NT2RP3004119, NT2RP3004294, OVARC1001049, OVARC1 001086, OVARC1001132, 
OVARC1001807, PLACE1000258, PLACE1000442, PLACE1000907, PLACE1003529, PLACE10041 66, 
PLACE1004168, PLACE1004887, PLACE1005250, PLACE1005682, PLACE1006079, PLACE1008549, 
PLACE1011407, PU\CE1011978, THYR01 000580, Y79AA1 000030, Y79AA1001090, Y79AA1001 523, 
Y79AA1 002334, Y79AA1 002378, HEMBB1 002302, 

15 [0039] The following 85 clones were categorized into disease-associated proteins. The clones categorized into dis- 
ease-associated proteins are those which matched the full-length sequences of Swiss-Prot database with the keywords 
"disease mutation" or "syndrome"; those which matched the data suggesting that the proteins are disease-associated 
proteins; or those which matchedthe full-length sequences of Swiss-Prot database and GenBank or UniGene database 
where the matched sequences are those of genes or proteins which had been deposited in the database of Online 

20 Mendelian Inheritance in Man (OMIM) (httpV/www. ncbi.nlm.nih.gov/Omim/), which is a database of human genes and 
diseases. 

BNGH41000020, HEMBA1000349, HEMBA1000590, HEMBA1000671 , HEMBA1 000835, HEMBA1001184, 
HEMBA1 001228, HEMBA1001886, HEMBA1003120, HEMBA1004250, H EM BA1 005246, HEMBA1005267, 
HEMBA1006707, HEMBA1006749 : HEMBA1006902, HEMBA1006916, HEMBA1007013, HEMBB1002120, 

25 MAMMA1000204, MAMMA1002080, NT2RM2000632, NT2RM2001126, NT2RM2001558, NT2RP1000271 , 
NT2RP1 000465, NT2RP1 000579, NT2RP2000447, NT2RP2000514, NT2RP2000739, NT2RP2001223, 
NT2RP2001529, NT2RP2001562, NT2RP2002674, NT2RP2003369, NT2RP2004108, NT2RP2004205, 
NT2RP2005535, NT2RP2005941 , NT2RP2006004, NT2RP3000059, NT2RP3000125, NT2RP3000201 , 
NT2RP3000232, NT2RP3000616, NT2RP3000677, NT2RP3000838, NT2RP3000921 , NT2RP3001542, 

30 NT2RP3002286, NT2RP3002721 , NT2RP3002737, NT2RP3002738, NT2RP3004481 , OVARC1000208, 
OVARC1000275, OVARC1000331 , OVARC1000410, OVARC1001086, OVARC1001132, OVARC1001 607, 
OVARC1001725, OVARC1001952, PLACE1000258, PLACE1000442, PLACE1000907, PLACE1001100, 
PLACE1001500, PLACE1 002905, PLACE1002967, PLACE1003407, PLACE1003428, PLACE1005005, 
PLACE1 005239, PLACE1005815, PLACE 1007028, PLACE1008716, PLACE1011407, PLACE1011978, 

35 PLACE2000118 ; THYPO1000580, THYRO1000866, THYRO1001 071 , THYRO1001478, Y79AA1001 062, 
Y79AA1001530, 

[0040] It is unclear, by the analyses so far, whether or not the remaining 212 clones encode proteins belonging to 
any of the categories of secretory and/or membrane proteins, glycoprotein-associated proteins, signal transduction 
-associated proteins, transcription-associated proteins or disease-associated proteins. Nonetheless, it is still possible 
40 for these clones to encode secretory and/or membrane proteins, glycoprotein-associated proteins, signal transduction- 
associated proteins, transcription-associated proteins, or disease-associated proteins. On the other hand, some of 
these clones can be presumed to have functions other than those as secretory and/or membrane proteins, glycoprotein- 
associated proteins, signal transduction-associated proteins, transcription-associated proteins and disease-associated 
proteins. 

45 [0041] Among the 212 clones, the following clones presumably belong to the categories of enzymes and/or metab- 
olism-associated proteins, cell division- and/or cell proliferation-associated proteins, cytoskeleton-associated proteins, 
nuclear proteins, DNA- and/or RNA-binding proteins, ATP-and/or GTP-binding proteins, protein synthesis- and/or pro- 
tein transport-associated proteins, or cellular defense-associated proteins, although it is unclear whether or not the 
clones belong to any of the categories of secretory and/or transmembrane proteins, glycoprotein-associated proteins, 

50 signal transduction-associated proteins, transcription-associated proteins, and disease-associated proteins. 

[0042] The following 1 0 clones presumably belong to the category of enzymes and/or metabolism-associated pro- 
teins. The clones herein defined as clones presumably belonging to the category of enzymes and/or metabolism- 
associated proteins matched data containing keywords such as "metabolism", "oxidoreductase" and "E.C. No. (Enzyme 
commission number)". 

55 HEMBA1003315, HEMBB1002465, MAMMA1000614, NT2RP2000178, NT2RP2001388, NT2RP2001903, 
NT2RP2002304, NT2RP2005878, NT2RP3001685, PLACE1006219 

[0043] The following 4 clones presumably belong to the category of cell division- and/or cell proliferation-associated 
proteins. The cDNA clones were herein defined as clones presumably belonging to the category of cell division- and/ 
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or cell proliferation-associated proteins matched data containing keywords such as "cell division", "cell cycle", "mitosis", 
"chromosomal protein", "cell growth" and "apoptosis". 
MAMMA1000403, NT2RM2000497, NT2RP2000394, Y79AA1 002121 

[0044] The following 6 clones presumably belong to the category of cytoskeleton-associated proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of cytoskeleton-associated proteins 
matched data containing keywords such as "structural protein", "cytoskeleton", "actin-binding" and "microtubules". 
MAMMA 1001609, NT2RM2000589, NT2RP3000063, PLACE 1004078 : PLACE 1004492, PLACE 1008657 
[0045] The following 7 clones presumably belong to the category of nuclear proteins. The cDNA clones were herein 
defined as clones presumably belonging to the category of nuclear proteins matched data containing keywords such 
as "nuclear protein". 

HEMBA1001878, HEMBA1002992, MAMMA1000614. NT2RM4000965. NT2RM2001738 NT2RP2001388 
Y79AA1002121 

[0046] The following 5 clones presumably belong to the category of DNA- and/or RNA-binding proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of DNA- and/or RNA-binding proteins 
matched data containing keywords such as "DNA-binding" and "RNA-binding". 
HEMBA1003072, HEMBA1006770, HEMBA1007332, NT2RM2000497, Y79AA1002121 

[0047] The following 7 clones presumably belong to the category of ATP- and/or GTP-binding proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of ATP- and/or GTP-binding proteins 
matched data containing keywords such as "ATP-binding M and "GTP-binding". 

HEMBA1002316, MAMMA1001609, NT2RM2000306, NT2RM2000497, NT2RP2000178, NT2RP3003729 
PLACE1 004305 

[0048] The following 7 clones presumably belong to the category of protein synthesis- and/or protein transport-as- 
sociated proteins. The cDNA clones were herein defined as clones presumably belonging to the category of protein 
synthesis-associated and/or protein transport-associated proteins matched data containing keywords such as "trans- 
lation regulation", "protein biosynthesis", "amino-acid biosynthesis", "ribosomal protein", "protein transport" and "signal 
recognition particle". 

NT2RM4000965, NT2RP2005069, NT2RP3000481, NT2RP3000789. NT2RP4001877, OVARC1001833 
OVARC1 002058, 

[0049] The following 1 clone presumably belongs to the category of cellular defense-associated proteins. The cDNA 
clones were herein defined as clones presumably belonging to the category of cellular defense-associated proteins 
matched data containing keywords such as "heat shock". "DNA repair" and "DNA damage". 
PLACE 1005539 

[0050] Although it is unclear whether or not 26 out of 1 74 clones other than the above-mentioned clones belong to 
any of the above-described categories, these clones are predicted to have some functions, based on the homology 
search using their full-length sequences thereof. The clone names and the gene definitions found in the result of ho- 
mology search are shown below, separated with a double-slash mark, //. 

HEMBA1000634//Homo sapiens T-cell activation protein (PGR1) gene, complete cds. 
HEMBA1002524//Human MHC Class I region proline rich protein mRNA. complete cds. 
HEMBA1003399//MVP1 PROTEIN. 

■ 

HEMBA1005489//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, com- 
plete cds. 

HEMBB1000542//Mus musculus bromodomain-containing protein BP75 mRNA, complete cds. 
MAMMA1 000788//Bos taurus P14 (p14) mRNA, complete cds. 
MAMMA1002128//ABC1 PROTEIN HOMOLOG PRECURSOR. 
NT2RM2000514//Homo Sapiens F-box protein Fbx21 (FBX21) mRNA, complete cds. 
NT2RM2000622//Mus musculus F-box protein FBL1 0 mRNA, partial cds. 
NT2RM4000100//Homo sapiens Leman coiled-coil protein (LCCP) mRNA, complete cds. 
NT2RP2005425//Homo sapiens mRNA for AKAP450 protein. 

NT2RP3001170//Mus musculus activity-dependent neuroprotective protein (Adnp) mRNA, complete cds. 

NT2RP3002571//Bos taurus mRNA for lyncein. 

NT2RP3004557//Human Ki nuclear autoantigen mRNA, complete cds. 

OVARC1001596//Homo sapiens Arf-like 2 binding protein BART1 mRNA, complete cds. 

PLACE1002153//Homo sapiens TACC2 protein (TACC2) mRNA, partial cds. 

PLACE 1 0031 63//Homo sapiens DBI-related protein mRNA, complete cds. 

PLACE1005736//Human mRNA for BAS-GRIP protein. 

PLACE1007702//Mus musculus TRA1 mRNA, complete cds. 

PLACE1011045//Homo sapiens E1-like protein mRNA, complete cds. 
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THYRO1000061//Mus musculus mRNA for UBE-1c1 } UBE-1c2, UBE-1c3, complete cds. 
THYRO1000964//Drosophila melanogaster Pelle associated protein Pellino (Pli) mRNA, complete cds. 
Y79AA1000776//Mus musculus mRNA for GSG1 , complete cds. 
Y79AA1001056//Homo sapiens MAID protein mRNA, complete cds. 
5 Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds. 

Y79AA1001793//Mus musculus mRNA for GSG1 , complete cds. 

[0051] So far, useful information for presuming the functions are unavailable for the remaining 148 clones, whose 
names are listed below. 

10 HEMBA1 000275, HEMBA1000300, H EM BA1 000443, HEMBA1000875, HEMBA1000907, HEMBA1001272, 
HEMBA1001296, HEMBA1001563, HEMBA1002164, HEMBA1002239, HEMBA1002985, HEMBA1003294, 
HEMBA1003487, HEMBA1 004007, HEMBA1004067, HEMBA1004085, HEMBA1004952, HEMBA1004971 , 
HEMBA1005145, HEMBA1005430, HEMBA1005913, HEMBA1006016, HEMBA1006517, HEMBA1006544, 
HEMBA1006912, HEMBA1007057, HEMBA1 007063, HEMBA1007291 , HEMBB1000276, HEMBB1000309, 

15 HEMBB1000642, HEMBB1001200, HEMBB1 001547, HEMBB1002039, HEMBB1002228, HEMBB1002663, 
MAMMA1 000046, MAMMA1 000449, MAMMA1 000528, MAMMA1000652, MAMMA1 000706. MAMMA1 000814, 
MAMMA1001066, MAMMA1001284, MAMMA1001623, MAMMA1001634, MAMMA1001901, MAMMA1002087, 
MAMMA1 002205, MAM MA 1002224, NT2RM2000582, NT2RM2001 643, NT2RM4000115, NT2RM4000295, 
NT2RM4001321, NT2RP1 000002, NT2RP1 000239, NT2RP1 000679, NT2RP1 000740, NT2RP1 000903, 

20 NT2RP2000240, NT2RP2001878, NT2RP2001921 , NT2RP2002015, NT2RP2002409, NT2RP2002510, 
NT2RP2003599, NT2RP2003931 , NT2RP2004069, NT2RP2004141 , NT2RP2004447, NT2RP2004837, 
NT2RP2005514, NT2RP2005632, NT2RP2005887, NT2RP2006099, NT2RP2006134, NT2RP3000427, 
NT2RP3000444, NT2RP3000645, NT2RP3000871 , NT2RP3001044, NT2RP3001061, NT2RP3001754, 
NT2RP3002281, NT2RP3002324, NT2RP3002353, NT2RP3002409, NT2RP3002448, NT2RP3002664, 

25 NT2RP3002887, NT2RP3002983, NT2RP3003448, NT2RP3003469, NT2RP3003473, NT2RP3003559, 
NT2RP3003963, NT2RP3004000, NT2RP3004202, NT2RP3004321 , 

NT2RP3004355, NT2RP3004374, NT2RP4002715, OVARC1 000090, OVARC1000137, OVARC1 000467, 
OVARC1000775, OVARC1000853, OVARC1000995, OVARC1 001222, OVARC1001260, OVARC1001727, 
OVARC1002178, PLACE1 000986, PLACE1001114, PLACE1 001229, PLACE1001788, PLACE1 003438, 

30 PLACE1 003460, PLACE 1003644, PLACE1 004028, PLACE1004199, PLACE1004519, PLACE1 005601 , 
PLACE1005669, PLACE1005768, PLACE1006515, PLACE1006786, PLACE1007040, PLACE1007077, 
PLACE1 007591, PLACE1 007971 , PLACE1 008984, PLACE1 009735, PLACE2000219, PLACE4000455, 
THYRO1000846, THYRO1000999, THYRO1001063, THYRO1001128, THYRO1001471, THYRO1001495, 
THYRO1001608, THYRO1001803, Y79AA1000127, Y79AA1 000750, Y79AA1001592, Y79AA1001863, 

35 [0052] In the 437 clones categorized into secretory and/or membrane proteins by using their full-length sequences, 
410 clones were also predicted to encode proteins having functions of secretory and/or membrane proteins by using 
their partial nucleotide sequences. In the 146 clones categorized into glycoprotein-associated proteins by using their 
full-length sequences, 124 clones were also predicted to encode proteins having functions of glycoprotein-associated 
proteins by using their partial nucleotide sequences. In the 57 clones categorized into signal transduction-associated 

40 proteins by using their full-length sequences, 46 clones were also predicted to encode proteins having functions of 
signal transduction-associated proteins by using their partial nucleotide sequences. In the 81 clones categorized into 
transcription-associated proteins by using their full-length sequences. 57 clones were also predicted to encode proteins 
having functions of transcription-associated proteins by using their partial nucleotide sequences. In the 85 clones cat- 
egorized into disease-associated proteins by using their full-length sequences, 6 clones were also predicted to encode 

45 proteins having functions of disease-associated proteins by using their partial nucleotide sequences. The number of 
clones, which were predicted to encode disease-associated proteins based on the full-length nucleotide sequences, 
is much greater than that predicted based on the partial sequences. The reason is that the full-length sequences were 
categorized by using the data found in the OMIM database into the category of disease-associated proteins. 
[0053] In some cases, the predicted functions based on the partial sequences are different from those based on the 

50 full-length sequences. The reason is that a protein does not always belong solely to a single category of the above- 
described functional categories, and therefore, it is possible for the protein to belong to both of the predicted functional 
categories. Besides, additional functions can be found for the clones classified into these functional categories by 
further analyses. 

[0054] The following list shows the cDNA clones predicted and selected on the basis of the partial sequences (5' 
55 sequences) as cDNAs encoding secretory and/or membrane proteins, glycoprotein-associated proteins, signal trans- 
duction-associated proteins, transcription-associated proteins, or disease-associated proteins. 
[0055] The clones that are selected by the score in the ATGpr and by the PSORT for the existence of a signal 
sequence can be expected to encode a secretory or membrane protein since they are predicted to possess the secretion 
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signal or a transmembrane region. The clones that are selected by the score in the ATGpr and by the PSORT for the 
existence of a signal sequence are listed below (254 clones). 

HEMBA1000300 HEMBA10Q0713 HEMBA1000907 
5 HEMBA1000962 HEMBA1001272 HEMBA1001297 

HEMBA1002164 HEMBA1 002239 HEMBA1 002420 

HEMBA1 002421 HEMBA1003101 HEMBA1 003294 

HE MBA 1003399 HEMBA1 003602 HEMBA1 003732 

HEMBA1004110 H EM BA 1004797 HEMBA1 005430 
10 HEMBA1006016 HEMBA1006171 HEMBA1006311 

HEMBA1 006335 HEMBA1 006357 HEMBA1 006572 

HEMBA1006658 HEMBA1006707 HEMBA1006902 

HEMBA1006960 HEMBA1007013 HEMBB1000276 

HEMBB1 000447 HEMBB1 000567 HEMBB1 000642 
is HEMBB1000905 HEMBB1001200 HEMBB1001407 

HEMBB1001530 HEMBB1001547 HEMBB1 001978 

HEMBB1002162 HEMBB1 002228 HEMBB1 002245 

HEMBB1 002427 HEMBB1 002465 HEMBB1 002663 

HEMBB1 002693 MAMMA1 000046 MAMMA1000102 
20 MAMMA1000118 MAMMA1000141 MAMMA1000449 

MAMMA1 000457 MAMMA1 000591 MAMMA1 000652 

MAMMA1 000681 MAMMA1 000986 MAMMA1 000994 

MAMMA1001043 MAMMA1001141 MAMMA1 001284 

MAMMA1001310 MAMMA1001344 MAMMA1001893 
25 MAMMA1001901 MAMMA1001957 MAMMA 1002070 

MAMMA1 002087 MAMMA1002165 MAMMA1 002205 

MAMMA1 002224 MAMMA1 002633 NT2RM2000241 

NT2RM2000306 NT2RM2000410 NT2RM2000514 

NT2RM2001643 NT2RM2001941 NT2RM4000115 
30 NT2RM4000997 NT2RM4001 321 NT2RM4001 325 

NT2RM4001768 NT2RP1 000050 NT2RP1 000448 

NT2RP1 000903 NT2RP1001563 NT2RP2000479 

NT2RP2001495 NT2RP2001915 NT2RP2001948 

NT2RP2002015 NT2RP2002063 NT2RP2002304 
35 NT2RP2002674 NT2RP2002721 NT2RP2003383 

NT2RP2003469 NT2RP2003593 NT2RP2003599 

NT2RP2003655 NT2RP2003664 NT2RP2004179 

NT2RP2004447 NT2RP2004495 NT2RP2004524 

NT2RP2004556 NT2RP2004837 NT2RP2005027 
40 NT2RP2005463 NT2RP200551 4 NT2RP2005887 

NT2RP2006042 NT2RP2006269 NT2RP3000169 

NT2RP3000460 NT2RP3000481 NT2RP3000645 

NT2RP3000789 NT2RP3000818 NT2RP3001012 

NT2RP3001044 NT2RP3001195 NT2RP3001560 
45 NT2RP3001685 NT2RP3001858 NT2RP3002160 

NT2RP3002281 NT2RP3002721 N12RP3002836 

NT2RP3002958 NT2RP3003076 NT2RP3003354 

NT2RP3003469 NT2RP3003535 NT2RP3003559 

NT2RP3003963 NT2RP3004000 NT2RP3004083 
50 NT2RP30041 33 NT2RP3004309 NT2RP3004321 

NT2RP3004355 NT2RP3004374 NT2RP4001001 

NT2RP4001879 NT2RP4002451 NT2RP4002715 

OVARC1000208 OVARC1000298 OVARC1 000439 

OVARC 1000775 OVARC1 000811 OVARC 1000853 
55 OVARC1001222 OVARC1001727 OVARC1001807 

OVARC1001833 PLACE1 000231 PLACE1000560 

PLACE1 000740 PLACE1000912 PLACE1000914 

PLACE1 000927 PLACE1 000986 PLACE1001100 
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PLACE1001183 PLACE"! 001 229 PLACE1001407 
PLACE1001536 PLACE1001788 PLACE1 002080 
PLACE1 002095 PLACE1 002374 PLACE1002518 
PLACE1 003407 PLACE1003428 PLACE1 003460 

5 PLACE1 003839 PLACE1 003845 PLACE1 004028 

PLACE1004199 PLACE1 004282 PLACE 1004305 
PLACE1 004482 PLACE1 004637 PLACE1 005005 
PLACE1 005250 PLACE1 005383 PLACE1 005410 
PLACE 1005544 PLACE 1005569 PLACE 1005601 

10 PLACE1 005660 PLACE1 005669 PLACE1 005725 

PLACE1 005768 PLACE1 005927 PLACE1 006079 
PLACE1 006093 PLACE1 00621 9 PLACE1 006277 
PLACE1 006443 PLACE1 006786 PLACE1 006809 
PLACE1 007040 PLACE1 007096 PLACE1 007296 

15 PLACE1 007626 PLACE1 007971 PLACE1 008469 

PLACE1 008984 PLACE1 008985 PLACE1 009067 
PLACE1009196 PLACE1 009527 PLACE1 009982 
PLACE1010251 PLACE1011236 PLACE2000219 
PLACE4000455 SKNMC1000004 SKNMC1000014 

20 THYR01 000036 THYR01 000099 THYR01 0001 96 

THYR01 000795 THYR01 000999 THYR01 001 237 
THYR01 001 327 THYR01 001 478 THYR01 001 495 
THYR01 001 523 THYR01 001 702 THYR01 001 725 
Y79AA1 000226 Y79AA1 000270 Y79AA1 000426 

25 Y79AA1 000521 Y79AA1 000776 Y79AA1 000959 

Y79AA1001013 Y79AA1001056 Y79AA1001264 
Y79AA1001328 Y79AA1001427 Y79AA1001430 
Y79AA1 001530 Y79AA1001592 Y79AA1001793 
Y79AA1001795 Y79M 1001 803 Y79AA1001863 

30 Y79AA1 002022 Y79AA1 002373 



[0056] In the example mentioned below, the 254 clones as described above were categorized into three groups 
according to their maximal value in the ATGpr and the result in the PSORT, which are shown in Table 7-10,11,12 (246 
clones), and Table 13, 14, 15 (8 clones). In the tables, the name of clone, indicate the name of the clone that was 

35 selected by the ATGpr and the PSORT; the name of sequence indicates the name of the 5'-end sequence of the clone 
on the left; the maximal ATGpr score indicates the maximal ATGprl score of the 5'-end sequence shown on the left; 
and signal indicates the presence of the signal sequence according to the prediction by the PSORT. In addition, the 
representative sequence is the sequence that has the longest sequence among the cluster in which the 5'-end sequence 
on the left was included. The maximal ATGpr score and signal on the right indicate the maximal ATGprl score of the 
representative sequence, and the presence of a signal sequence in the representative sequence according to the 
prediction by the PSORT respectively. The 170 clones shown in Table 7-10, having the maximal score in the ATGprl 
higher than 0.5, and predicted to possess a signal sequence by the PSORT, are very likely to be full-length and encode 
a secretory or membrane protein. The 35 clones in Table 11 , which have the maximal score in the ATGprl 0.3 or higher 
and less than 0.5, and predicted to have a signal sequence, are also as well. And, the 41 clones in Table 12, having 

45 the maximal score in the ATGprl 0 or higher and less than 0.3, and predicted to have a signal sequence, are likely to 
be full-length and encode a secretory or membrane protein. 

[0057] The 8 clones in Table 13 (4 clones), Table 14 (2 clones), and Table 15 (2 clones) have the maximal score in 
the ATGprl 0.5 or higher, 0.3 or higher and less than 0.5, and 0 or higher and less than 0.3, respectively, and are 
predicted to have no signal sequence by the PSORT However, these clones contain a region that is recognized by 
50 the PSORT to be a signal sequence within the representative sequence composing the same cluster. Thus, the clones 
were judged as a full-length clone which encodes a membrane protein, especially. 

[0058] The clones selected by the score in the ATGpr and by the keywords in the top hit data in the SwissProt are 
likely to encode a secretory or membrane protein, or proteins with functions associated to signal transduction, glyco- 
protein, transcription, and diseases according to the respective keywords. These 659 clones are shown below. Here, 
55 top hit data is defined to be data of known amino acid sequence which is identified to be the most homologous sequence 
in homology search using the SwissProt. 



BNGH41 000020 BNGH41 000087 BNGH41 000091 
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HEMBA1000006 HEMBA1000121 HEMBN1000128 
HEM BA 1000275 HEMBA1 000349 HEMBA1 000443 
HEM BA1 000462 HEMBA1 000477 HEMBA1 000590 
HEMBA1 000634 HEMBA1 000671 HEMBA1 000732 
HEMBA1000745 HEMBA1000835 HEMBA1000875 
HEMBA1000907 HEMBA1000940 HEMBA1001184 
HEMBA1001221 HEMBA1001228 HEMBA1001296 
HEMBA1001390 HEMBA1001563 HEMBA1001621 
HEMBA1001878 HEMBA1001886 HEMBA1 002048 
HEMBA1002131 HEMBA1002163 HEMBA1002164 
HEMBA1002167HEMBA1002178HEMBA1002195 
HEMBA1 002227 HEMBA1002316 HEMBA1 002421 
HEMBA1002524 HEMBA1002551 HEMBA1002767 
HEMBA1 002985 HEMBA1 002992 HEMBA1 003047 
HEMBA1 003072 HEMBA1003101 HEMBA1 003120 
HEMBA1 003230 HEMBA1003315 HEMBA1 003392 
HEMBA1 003487 HEMBA1 003497 HEMBA1 003530 
HEMBA1 003945 HEMBA1 004007 HEMBA1 004067 
HEMBA1 004085 H EM BA 1004250 HEMBA1 004391 
HEMBA1 004444 HEMBA1 004454 HEMBA1 004505 
HEMBA1 004785 HEMBA1 004797 HEMBA1 004952 
HEMBA1 004971 HEMBA1 004982 HEMBA1 005070 
HEMBA1 005084 HEMBA1005145 HEMBA1 005230 
HE MBA 1005246 HEMBA1 005267 HEMBA1 005337 
HEMBA1 005449 HEMBA1 005489 HEMBA1 005522 
HEMBA1 005545 HEMBA1 005698 HEMBA1005913 
HEMBA1 005929 HEMBA1 005945 HEMBA1 006276 
HEMBA1 006299 HEMBA1 006335 HEMBA1 006430 
HEMBA1 006482 HEMBA1 00651 7 HEMBA1 006544 
HEMBA1 006572 HEMBA1 006707 HEMBA1 006724 
HEMBA1 006749 HEMBA1 006770 HEMBA1 006902 
HEMBA1006912 HEMBA1006916 HEMBA1007013 
HEMBA1 007057 HEMBA1 007063 HEMBA1 007226 
HEMBA1 007241 HEMBA1 007291 HEMBA1 007332 
HEMBB1000106 HEMBB1000309 HEMBB1000407 
HEMBB1 000447 HEMBB1 000542 HEMBB1 000567 
HEMBB1 000668 HEMBB1 000679 HEMBB1 000881 
HEMBB1001026 HEMBB1001048 HEMBB1001200 
HEMBB1001573 HEMBB1001847 HEMBB1 001959 
HEMBB1 002039 HEMBB1 002041 HEMBB1 002051 
HEMBB1002120 HEMBB1002302 HEMBB1002427 
HEMBB1 002661 MAMMA1000106 MAMMA1 000204 
MAMMA1 000226 MAMMA1 000403 MAM MA1 000473 
MAMMA1 000496 MAMMA1 000528 MAMMA1 000591 
MAMMA1000614 MAMMA1000681 MAMMA1000706 
MAMMA1 000788 MAMMA1 000810 MAMMA1000814 
MAMMA1000881 MAMMA1001043 MAMMA1001066 
MAMMA1001094 MAMMA1001150 MAMMA1001237 
MAMMA1001418 MAMMA1001532 MAMMA1001609 
MAMMA1001615 MAMMA1001623 MAMMA1001634 
MAMMA1001893 MAMMA1001957 MAMMA1001978 
MAMMA1002070 MAMMA1 002080 MAMMA1 002091 
MAMMA1002095 MAMMA1002128 MAMMA1002142 
MAMMA1002165 MAMMA1 002234 MAMMA1 002586 
MAMMA1 002633 MAMMA1003126 NT2RM 1000407 
NT2RM1 000462 NT2RM1000542 NT2RM1 000580 
NT2RM1000789 NT2RM1000855 NT2RM1000858 
NT2RM 1000899 NT2RM2000410 NT2RM2000423 
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NT2RM2000497 NT2RM2000565 NT2RM2000582 
NT2RM2000589 NT2RM2000622 NT2RM2000632 
NT2RM2000773 NT2RM2001126 NT2RM2001558 
NT2RM2001626 NT2RM2001738 NT2RM2001767 

5 NT2RM2001 792 NT2RM2001 81 8 NT2RM2001 902 

NT2RM2001939 NT2RM2001941 NT2RM4000100 
NT2RM4000198 NT2RM4000284 NT2RM4000295 
NT2RM4000326 NT2RM4000417 NT2RM4000444 
NT2RM4000587 NT2RM4000593 NT2RM4000648 

10 NT2RM4000761 NT2RM4000965 NT2RM4001 377 

NT2RM4001735 NT2RM4001843 NT2RM4002352 
NT2RP1 000002 NT2RP1 000050 NT2RP1000181 
NT2RP1 000239 NT2RP1 000261 NT2RP1 000271 
NT2RP1 000300 NT2RP1 000325 NT2RP1 000465 

15 NT2RP1 000468 NT2RP1 000551 NT2RP1000579 

NT2RP1000613 NT2RP1 000679 NT2RP1 000740 
NT2RP1 000981 NT2RP1001004 NT2RP1001020 
NT2RP1001031 NT2RP2000092 NT2RP2000178 
NT2RP2000240 NT2RP2000394 NT2RP2000447 

20 NT2RP2000514 NT2RP2000533 NT2RP2000610 

NT2RP2000616 NT2RP2000649 NT2RP2000663 
NT2RP2000694 NT2RP2000712 NT2RP2000739 
NT2RP2000818 NT2RP2000903 NT2RP2001200 
NT2RP2001223 NT2RP2001276 NT2RP2001388 

25 NT2RP2001469 NT2RP2001480 NT2RP2001495 

NT2RP2001514 NT2RP2001529 NT2RP2001538 
NT2RP2001562 NT2RP2001662 NT2RP2001755 
NT2RP2001769 NT2RP2001817 NT2RP2001878 
NT2RP2001903 NT2RP2001921 NT2RP2001948 

30 NT2RP2001956 NT2RP2002063 NT2RP2002188 

NT2RP2002232 NT2RP2002304 NT2RP2002409 
NT2RP2002510 NT2RP2002527 NT2RP2002533 
NT2RP2002564 NT2RP2002824 NT2RP2002942 
NT2RP2002974 NT2RP2002976 NT2RP2003042 

35 NT2RP2003138 NT2RP2003179 NT2RP2003210 

NT2RP2003302 NT2RP2003369 NT2RP2003390 
NT2RP2003469 NT2RP2003545 NT2RP2003593 
NT2RP2003655 NT2RP2003664 NT2RP2003931 
NT2RP2003940 NT2RP2003950 NT2RP2004069 

40 NT2RP2004108 NT2RP2004141 NT2RP2004205 

NT2RP2004447 NT2RP2004606 NT2RP2004648 
NT2RP2004670 NT2RP2004794 NT2RP2004847 
NT2RP2005069 NT2RP2005163 NT2RP2005181 
NT2RP2005247 NT2RP2005378 NT2RP2005391 

45 NT2RP2005425 NT2RP2005535 NT2RP2005541 

NT2RP2005597 NT2RP2005632 NT2RP2005666 
NT2RP2005774 NT2RP2005878 NT2RP2005883 
NT2RP2005941 NT2RP2005994 NT2RP2006004 
NT2RP2006042 NT2RP2006092 NT2RP2006099 

50 NT2RP2006134 NT2RP2006269 NT2RP2006512 

NT2RP3000011 NT2RP3000022 NT2RP3000059 
NT2RP3000063 NT2RP3000125 NT2RP3000148 
NT2RP3000171 NT2RP3000172 NT2RP3000201 
NT2RP3000232 NT2RP3000304 NT2RP3000378 

55 NT2RP3000427 NT2RP3000436 NT2RP3000444 

NT2RP3000481 NT2RP3000616 NT2RP3000645 
NT2RP3000652 NT2RP3000676 NT2RP3000677 
NT2RP3000721 NT2RP3000820 NT2RP3000838 
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NT2RP3000871 NT2RP3Q00907 NT2RP3000921 
NT2RP3001012NT2RP3001061 NT2RP3001159 
NT2RP3001170 NT2RP3001195 IMT2RP3001240 
NT2RP3001271 NT2RP3Q01322 NT2RP3001388 
5 NT2RP3001542 NT2RP3001560 NT2RP3001592 

NT2RP3001650 NT2RP3001738 NT2RP3001754 
NT2RP3001976 NT2RP3002015 NT2RP3002160 
NT2RP3002286 NT2RP3002311 NT2RP3002324 
NT2RP3002342 NT2RP3002353 NT2RP3002409 
" NT2RP3002411 NT2RP3002448 NT2RP3002571 

NT2RP3002664 NT2RP3002737 NT2RP3002738 
NT2RP3002790 NT2RP3002836 NT2RP3002887 
NT2RP3002900 NT2RP3002958 NT2RP3002983 
NT2RP3003000 NT2RP3003076 NT2RP3003354 
'5 NT2RP3003448 NT2RP3003473 NT2RP3003527 

NT2RP3003532 NT2RP3003614 NT2RP3003729 
NT2RP3003849 NT2RP3003874 NT2RP3003939 
NT2RP3004025 NT2RP3004067 NT2RP3004075 
NT2RP3004090 NT2RP3004119 NT2RP3004130 
NT2RP3004133 NT2RP3004202 NT2RP3004294 
NT2RP3004309 NT2RP3004345 NT2RP3004406 
NT2RP3004481 NT2RP3004552 NT2RP3004557 
NT2RP3004625 NT2RP3004640 NT2RP3004647 
NT2RP4000108 NT2RP4000634 NT2RP4000962 
2 * NT2RP4001009 NT2RP4001467 NT2RP4001877 

NT2RP4001879 NT2RP4002187 NT2RP4002451 
NT2RP4002750 OVARC1 000003 OVARC1 000090 
OVARC1000105 OVARC1000137 OVARC1 000255 
OVARC 1000275 OVARC1 000307 OVARC1000313 
30 OVARC1 000331 OVARC 1000410 OVARC1 000439 

OVARC1 000467 OVARC1000529 OVARC1 000553 
OVARC1000873 OVARC1000916 OVARC1000956 
OVARC1000995 OVARC1001030 OVARC1001049 
OVARC1001086 OVAPC1001132 OVARC1001163 
35 OVARC 1001 222 OVARC1001260 OVARC1001336 

OVARC 1001 338 OVARC1001569 OVARC1 001570 
OVARC1001596 OVARC1001607 OVARC1001725 
OVARC1001952 OVARC1001991 OVARC1002058 
OVARC1002178 PLACE1000033 PLACE1000258 
<0 PLACE1 000442 PLACE1 000740 PLACE1 000907 

PLACE1001016 PUCE1001114 PLACE1001123 
PLACE1001231 PLACE1001340 PLACE1001401 
PLACE1001407 PLACE1001464 PLACE1001500 
PLACE1001516 PUCE1001564 PLACE1001655 
PLACE1001795 PLACE1001836 PLACE1001918 
PLACE1001949 PLACE1 002080 PLACE1 002095 
PLACE1002153 PLACE1 002329 PLACE1 002355 
PLACE1 002374 PLACE1 002547 PLACE1 002726 
PLACE1 002905 PLACE1 002911 PLACE1 002967 
PLACE1003135 PLACE1 003163 PLACE1 003428 
PLACE1003438 PLACE1003460 PLACE1003529 
PLACE1 003573 PLACE1 003598 PLACE1 003644 
PLACE1003737 PLACE1 003772 PLACE1 003852 
PLACE1 004078 PLACE1004166 PLACE1004168 
55 PLACE1004279 PLACE1004441 PLACE1004450 

PLACE1004482 PLACE1004492 PLACE1004519 
PLACE 1004520 PLACE1 004630 PLACE1 004648 
PLACE1004816 PLACE1 004887 PLACE1 005003 



45 



50 



34 



EP1 130 094 A2 



PLACE1 005031 PLACE1 005239 PLACE1 005383 
PLACE1 005426 PLACE1005519 PLACE1 005539 
PLACE1 005544 PLACE1 005569 PLACE1 005682 
PLACE1 005736 PLACE1 005745 PLACE1005815 

5 PLACE1 005878 PLACE1 005927 PLACE1 006071 

PLACE1 006073 PLACE1 006208 PLACE1 006277 
PLACE1006290 PLACE1006443 PLACE1006515 
PLACE1006716 PLACE1 006959 PLACE1 007028 
PLACE1 007077 PLACE1 007081 PLACE1 007096 

10 PLACE1007296 PLACE1007591 PLACE1007702 

PLACE1007845 PLACE1007881 PLACE1008282 
PLACE1 008297 PLACE1 008359 PLACE1 008469 
PLACE1 008549 PLACE1 008657 PLACE1008716 
PLACE1 008744 PLACE1 008984 PLACE1 008985 

'5 PLACE1 009279 PLACE1 009527 PLACE1 009546 

PLACE1 009600 PLACE1 009735 PLACE1010011 
PLACE1010078 PLACE1010081 PLACE1010251 
PLACE1010445 PLACE1010713 PLACE1010784 
PLACE1010827 PLACE1010968 PLACE1011045 

20 PLACE1011116 PLACE1011181 PLACE1011236 

PLACE1011364 PLACE1011407 PLACE1011516 
PLACE1011708 PLACE1011824 PLACE1011978 
PLACE2000118 PLACE3000181 PLACE3000213 
PLACE4000354 SKNMC1000014 SKNMC1 000082 

25 THYR01 000061 THYRO1000196 THYR01 000400 

THYR01 000580 THYR01 000584 THYR01 000678 
THYR01 000776 THYR01 000795 THYR01 000846 
THYR01 000866 THYR01 000956 THYR01 000964 
THYR01 001 063 THYR01 001 071 THYR01 001 1 02 

30 THYRO1001113THYRO1001128THYRO1001205 
THYR01 001 242 THYR01 001 266 THYR01 001 456 
THYR01 001 457 THYR01 001 471 THYR01 001 478 
THYR01 001 529 THYR01 001 593 THYR01 001 608 
THYR01 001 641 THYR01 001 700 THYR01 001 702 

35 THYR01 001 770 THYR01 001 803 Y79AA1 000030 

Y79AA1 000127 Y79AA1 000207 Y79AA1 000270 
Y79AA1 000426 Y79AA1 000750 Y79AA1 000777 
Y79AA1 000876 Y79AA1 000888 Y79AA1 000967 
Y79AA1001062 Y79AA1001090 Y79AA1001212 

40 Y79AA1001272 Y79AA1001426 Y79AA1001523 

Y79AA1001727 Y79AA1001787 Y79AA1001799 
Y79AA1001803 Y79AA1001863 Y79AA1 002058 
Y79AA1002121 Y79AA1002129 Y79AA1002213 
Y79AA1 002334 Y79A A 1002376 Y79AA 1002378 

45 Y79AA1 002381 NT2RP2006580 



[0059] Among the clones, the following 83 clones are identical to the clones selected by the score in the ATGpr and 
the prediction by the PSORT for the existence of a signal sequence. 



HEMBA1 000907 
HEMBA1002164 
HEMBA1 002421 
HEMBA1003101 
HE MBA 1004797 
HE MBA 1006335 
HEMBA1 006572 
HEMBA1 006707 
HEMBA1 006902 



NT2RM2000410 
NT2RM2001941 
NT2RP 1000050 
NT2RP2001495 
NT2RP2001948 
NT2RP2002063 
NT2RP2002304 
NT2RP2003469 
NT2RP2003593 



PLACE 1000740 
PLACE 100 1407 
PLACE 1002080 
PLACE 1002095 
PLACE 1002374 
PLACE 1003428 
PLACE 1003460 
PLACE 1004482 
PLACE 1005383 
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HEMBA1007013 NT2RP2003655 PLACE1005544 
HEMBB1 000447 NT2RP2003664 PLACE 1005569 
HEMBB1 000567 NT2RP2004447 PLACE1 005927 
HEMBB1001200 NT2RP2006042 PLACE 1006277 
HEMBB1 002427 NT2RP2006269 PLACE1 006443 
MAMMA1 000591 NT2RP3000481 PLACE1 007096 
MAMMA1 000681 NT2RP3000645 PLACE1 007296 
MAMMA1001043 NT2RP3001012 PLACE1 008469 
MAMMA1001893 NT2RP3001195 PLACE1 008984 

MAMMA1001957 NT2RP3001560 PLACE1 008985 

MAMMA1 002070 NT2RP3002160 PLACE1 009527 

MAMMA1002165 NT2RP3002836 PLACE1010251 

MAMMA1 002633 NT2RP3002958 PLACE1 011236 

NT2RP3003076 SKNMC 1000014 

NT2RP3003354 THYPO1000196 

NT2RP30041 33 THYR01 000795 

NT2RP3004309 THYRO1001478 

NT2RP4001 879 THYR01 001 702 

NT2RP4002451 Y79AA1 000270 

OVARC1000439 Y79AA1000426 

OVARC 1001 222 Y79AA 100 1803 

Y79AA1001863 



[0060] The 446 clones in Table 16,17,18,19, and 20, and NT2RP2006580 are predicted to encode a secretorv or 
membrane prote.n. Among them : 77 clones were identical to the clones selected by th ^ score" ~? A TG^he 

?S "t^^ 6XiStenCe ° f 3 Si9na ' SGqUenCe 4 of tSllL^S ,n Table 

7-15). Besides many clones were turned out to be identical to the clones selected as a protein associated J rth 1 

SET The 2? I 1 "" 6 " r:! 0 ™ identk:a, t0 th ° Se S6,eCted 35 3 ^ Stated XZe^ ^ * 

^ 7£?tZ?£ Pred J te K d t0 enC ° de 3 9,yCOpr0tein ' Am ° ng them ' 53 cIones •» ^tica. to 
And mQ ^ V ATGpr and thG P rediction b V PSORT for the existence of a signal sequence 

And many clones were turned out to be identical to the clones selected as a secretory or membrane prZ ZZer 
thena were clones identical to those selected as a protein associated with a disease P ' 

mm TUe TsoTnl ^ T fS PrediCtGd t0 enC ° de a Pr ° tein aSS0Clated t0 transduction. 

E Th! f , Vt* 23 ^ PredlCt6d t0 GnCOde a protein associated to transcription. 

522 J h l 1 7 ° ] T S m ab ' e 24 9re PredlCted t0 encode a P rotein associa ted with diseases 

[0065] In these clones, 532 clones have the maximal ATGprl score of 0.5 or higher (Table 25) 60 clones have thP 

J™ £? r ! SC ° re I 0 ' 3 " hi9her and ,GSS than « C™e 26 -nd NT2RP2006580^d fZT^ereZ 
maximal ATGpM score of 0 or higher and less than 0.3 (Table 27) 

K?i 2 C,0 r S ^ Tab,G 25 ' e3Ch haVlng the maximal score jn the ATGprl 0.5 or higher are very likelv to 

be fun-length and encode a secretory or membrane protein, or proteins associated to signal tnmKn q^Si? 

0 3 Se n r 11 TT 5 n f neS " Tab,e 26 and NT2RP20 °6580, which have th.^axJ^^S^S' 
0.3 or h gher and less than 0.5, are likely to be full-length and encode a secretory or membrane protein orZfZl 
associated to s.gnal transduction, glycoprotein, transcription, or diseases. 67 clones in tSTt l^*?JSS 
= , n theATGpr1 Oorhigher and less tha^ 

moSl ' T aSS °' ,ated t0 Sl9nal transduction ' g^oprotein, transcription, or diseases * 

EITL S ? me Se,eCt,n9 CDNA C '° neS prediCted t0 encode secr *tory and/or transmembrane pro- 

the orlin T em S h 6 C ° mPlete amin ° SeqUenCe ' " fe p0SSible t0 anal V ze logical aS^JSE 

SSL " 18 T*'? 5 ana ' yZe bi0,09iCal aC,iVity ° f 3 SeCreto, V P ro,ein ° r a membrane protein or proteins asso- 
rted to signal transduction, glycoprotein, or transcription, based on the methods in "Gene Transcrtot on° 
D., and H,gg ns S.J. edit, (1993)), "Glycobiology" (Fukuda M., and Koba.a A. a«Tl^SS^^S2 
and Le,gh I. ed-t (1 993)), "Extracellular Matrbc" (Haralson M.A., and Hassell J.R. edrt, (i^JSS F» 
(Latchman D.S. ed,«, (1993)), "Signal Transduction" (MiHigans G. edit, (1992)), featured in t7 ^SESSSS. 
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Series" (IRL PRESS), or "Signal Transduction Protocols" (Kendall D.A., and Hitl S.J. edit, (1 995), "Glycoprotein Analysis 
in Biomedicine" (Hounsell E.F. edit, (1993)), featured in "Method in Molecular Biology" (Humana Press). 
[0070] As to a protein associated with a disease, it is possible to perform a functional analysis as described above, 
but also possible to analyze correlation between the expression or the activity of the protein and a certain disease by 
5 using a specific antibody that recognizes the protein. Alternatively, it is possible to utilize the database Online Mendelian 
Inheritance in Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/) , which is a database of human genes and diseases, 
to analyze the protein. New information is constantly being deposited in the OMIM database. Therefore, it is possible 
for one skilled in the art to find a new relationship between a particular disease and a gene of the present invention in 
the updated database. 

10 [0071] Proteins associated with diseases are useful in drug development as they can be utilized as a diagnostic 
marker, a drug that regulates the level of their expression and activities, or a target of gene therapy. Also, as for a 
secretory protein, membrane protein, or proteins associated with signal transduction, glycoprotein, or transcription, 
search of the OMIM with the keywords mentioned below revealed that the proteins are associated with many diseases. 
Also, relationship between a proteins associated to signal transduction or transcription and diseases is reported in 

15 "Transcription Factor Research- 1999" (Fujii, Tamura, Morohashi, Kageyama, and Satake edit, (1999) Jikken-lgaku 
Zoukan, Vol.17, No. 3), and "Gene Medicine" ((1999) Vol.3, No. 2). Thus, not only a protein associated with diseases, 
but also a secretory protein, membrane protein, or protein associated with signal transduction, glycoprotein, or tran- 
scription is involved in diseases, suggesting these proteins also are very important as a target in medical industry. 
[0072] Keywords used in the search of the OMIM 

20 

(1) secretion protein 

(2) membrane protein 

(3) channel 

(4) extracellular matrix 
25 (5) receptor 

(6) glycoprotein 

(7) protein kinase 

(8) calmodulin kinase 

(9) transcription factor 

30 

[0073] Shown in the search result are only the accession numbers in the OMIM. Using the number, data showing 
the relationship between a disease and a gene or protein can be seen. The OMIM data has been renewed everyday. 

1) Secretion protein 

35 268 entries found, searching for "secretion protein" 

104760, 176860, 160900, 107400, 118910, 139320, 603850, 147572, 176880,600946, 603215, 157147,600174, 
151675, 170280, 179512, 179513, 138120, 179509, 246700, 179510, 600626, 179511 , 600998, 109270, 601489, 
154545, 179490, 185860, 603216, 122559, 601746, 147290, 602672, 146770, 603062, 179508, 131230, 601591 , 
602421, 139250, 167805, 167770, 600041 , 600564, 118825, 601146, 300090, 600753, 601652, 600759, 600768, 

40 602434, 182590,603166,308230,602534,603489, 107470, 150390, 104610, 173120, 158106, 143890,306900, 

308700, 134797, 137350,227500, 176300, 107730, 600760, 138079, 120180, 120160, 120150, 124092, 138160, 
101000, 227600, 600509, 601199, 142410, 104311, 193400,201910, 107300, 122560,272800,217000,590050, 
147670, 133170, 176730, 300300, 134370, 274600, 120140, 162151, 158070, 

152790, 120120, 106100, 300200, 192340, 190160, 138040, 147470, 147620, 173350, 147380, 152200, 152760, 
45 157145, 153450, 264080, 113811 , 600937, 600840, 188545, 202110, 600514, 186590, 603372, 136435, 137241, 

252800, 214500, 207750, 138850, 139191 , 142640, 138130, 189907, 603692, 600633, 603355, 107270, 600377, 
147892, 232200, 600281 , 232800, 602358, 137035, 601771 , 601769, 253200, 601933, 118444, 600270, 120700, 
600945, 603732, 147660, 600761, 172400, 600823, 600877, 130080, 171060, 107740, 307800, 602843, 130660, 
152780, 124020, 601124, 601340, 601604, 601610, 171050, 312060, 232700, 300159, 142703, 600734, 125255, 
50 168450, 123812, 188540, 147940, 188450, 600839, 182452, 188400, 182280, 176760, 263200, 600264, 188826, 

252650, 601185, 1 62641 , 137216, 601398, 601538, 118888, 118445, 601745, 

190180, 601922, 182098, 602008, 147440, 602384, 600031 , 109160, 602663, 151670, 602682, 602730, 602779, 
146880, 603061 , 142704, 603140, 106150, 600732, 153620, 603318, 139392, 600042, 102200, 603493, 182100, 
264300, 603795, 184600 

55 

2) Membrane protein 

1017 entries found, searching for "membrane protein" 

130500,305360, 153330, 173610, 170995, 109270, 170993,309060, 120920,602333, 133740, 133710,602690, 
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133730,159430,600897,133090,601178,602413,602003,109280 603237 6021 71 m777fi xno™ 

S333333SS=S33 

iHiiiilpai 

S 2£o 1072 8 71 ' J£S ^ ^ ^ 16 ^30, 600874, 113730. 155550. 602701 306400 
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193065, 193001, 191306, 600504, 601020, 191191 , 602372, 190181, 600534, 188380, 186854, 186360, 600530, 
185250, 182331 , 600535, 182305, 601296, 600582, 600732, 600734, 600742, 600782, 176802, 176266, 600769, 
601883, 600864, 601901, 176260, 173490, 600910, 601905, 171890, 600916, 601987, 602679, 162651 , 161555, 
160994, 602714, 602715, 602724, 602736, 300007, 602783, 275630, 602836, 270200, 602871 , 159460, 602876, 

5 154540, 153900, 602890, 601153, 602190, 602905, 153634, 153337, 602914, 

152310, 151690, 151625, 602935, 602974, 150325, 602992, 150320, 250790, 603006, 603007, 603008, 150292, 
233690, 603046, 150210, 603061, 147940, 603063, 221770, 223100, 603097, 147880, 603118, 147730, 146928, 
146630, 142622,603149, 603150,603151,600923, 138981 , 138590, 138330,216950, 603192, 138297,603202, 
601002, 602343, 138230, 136131 , 60321 7, 603220, 134660, 131390, 131235, 603242, 603243, 130130, 602345, 

10 126455, 601123, 126064, 125240, 602359, 603312, 602380, 603318, 123890, 123836, 603356, 603361 , 603366, 

123830, 179610, 188060, 123620, 120980, 186355, 118510, 114835, 114217, 113810, 603499, 182310, 111740, 
109610, 603548, 603564, 108740, 603598, 603613, 107273, 603626, 602518, 179410, 603647, 602515, 603652, 
106195, 602573, 178990, 105210, 104615, 167055, 603717, 104614, 603728, 

104210, 603749, 603750, 103850, 602608, 603787, 603788, 603796, 173445, 103220, 102910, 102681 , 102670, 
15 102642, 603833, 173391, 102576, 102575, 171833, 102573, 101800, 603875, 601108 

3) Channel 

272 entries found, searching for "channel" 

176266, 600724, 170500, 182390, 123825, 114208, 114205, 601784, 114206, 600937, 114204, 603415, 600053, 
20 114209, 114207, 600760, 118425, 601011, 192500, 176261 , 600761 , 176260, 600359, 600228, 600877, 602235, 

300008, 182389, 182391 . 601328, 601534, 600504, 602323, 601958, 602780, 602781 , 601327, 601012, 600734, 
603208, 1 82392, 603220, 60321 9, 603888, 600054, 602232, 601 745, 603537, 602604, 603796, 30291 0, 602866, 
601 013, 602905, 602906, 600163, 152427, 180901 , 600702, 600308, 602754, 107776, 602024, 314555, 601949, 
600235, 602023, 176263,600681, 176265, 193245, 603305, 176258, 602983, 601219, 601141, 176267, 602343, 
25 602726, 138253, 176262, 600003, 600397, 602872, 138249, 600843, 600935, 600580, 600845, 602158, 602106, 

176264, 300110, 176257, 602717, 603493, 176268, 600932, 602727, 138254, 

603652, 300138, 602420, 600570, 600150, 603583, 602345, 603749, 601142, 176256, 600846, 138252, 602982, 
603787, 602836, 603788, 602566, 603651 , 602421 , 100690, 107777, 100725, 100710, 600509, 603061 , 154275, 
304040, 154276, 180902, 121014, 602368, 139311 , 601 383, 108745, 601313, 601042, 600131, 186360, 600109, 

30 600229, 600170, 603319, 601485, 118503, 180903, 602076, 124030, 601059, 601212, 601218, 147450, 600855, 

600919, 601154, 601157, 171060, 600968, 182139, 131230, 121015, 600421 , 113730, 249210, 310500, 600637, 
125950, 118800, 156490, 602974, 104610, 121011 , 602522, 118504, 300041, 160900, 601382, 602103, 600465, 
602014, 600442, 601109, 602481 , 277900, 254210, 138247, 164920, 170280, 171050, 128100, 173910, 600884, 
123885, 602887, 600232, 180297, 137192, 600304, 138251 , 603053 : 300103, 

35 603152, 603199, 118511, 118508, 138079, 600983, 182307, 603324, 305990, 603418, 114080, 232200, 600046, 

600040, 602403, 603750, 603785, 104210, 600019, 600300, 182860, 603852, 603853, 603855, 516060 

4) Extracellular matrix 

167 entries found, searching for "extracellular matrix" 

40 603479, 602201, 601418, 601548, 154870, 115437, 602285, 602262, 602261 , 134797, 600754, 120361, 116935, 

602263, 603320, 601807, 603321, 185250, 1 85261 , 253700, 128239, 120324, 193300, 276901 , 308700, 600514, 
600261, 602109, 120140, 120150, 147557, 193400, 600536, 188826, 120180, 11 8661 , 120320, 152200, 135821, 
112260, 230740, 602090, 155760, 192975, 190182, 602108, 601463, 186745, 600900, 600985, 600758, 602369, 
179590, 601211, 600065, 602178, 600262, 182888, 182889, 151510, 182120, 150325, 190181, 150370, 186355, 

45 193065, 165070, 154705, 147559, 146650, 146640, 153619, 175100, 187380,231050, 188060, 135820, 156790, 

130660, 301870, 128240, 600076, 600119, 193210, 600215, 600245, 121010, 150240, 600309, 600491 , 222600, 
120328, 600564, 600596, 600616, 600700, 600742, 120325, 138297, 600930, 

156225, 601028, 601050, 601105, 253800, 601284, 601313, 120280, 310200, 601492, 120250, 601587, 601636, 
601652, 601692, 601728, 120220,601915, 602048, 155120,310300, 120210, 120165, 120120, 118940, 116930, 
50 602264, 11 6806, 602366, 120470, 602415, 602428, 602453, 602505, 602574, 603005, 603196, 603221 , 60331 9, 

107269, 216550, 103320, 603489, 603551 , 603767, 603799, 603842 

5) Receptor (including membrane proteins, and also including transcription factors, since nuclear proteins were 
not excluded in the search) 

55 1 606 entries found, searching for "receptor" 

600408, 176943, 107770, 601531 , 143890, 313700, 162643,202200, 147670, 191306, 182131 , 192321 , 138249, 
190120, 600264, 162321 , 603613, 603614, 602164, 138251 , 182134, 138040, 152790, 602393, 133430, 601769, 
304800, 147280, 168468, 147730, 155555, 191190, 109760, 122561, 136537, 602034, 147138, 187930,312861, 
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107450,138981,182135,603612,191191,600933,801267,600168,120650 182133 601919 108962 1fi?™ 
1 02576, 600022, 603366. 600052, 1 36430, 601 835. 600825, 600979 1 02981 92974' IS JSSZ 

601972.191315.602601.603611.602943.601300.191316 108960 ^^wf JS» SSlTSf' 
602392, 602910, 603372. 106165. 601199, 180901 , 131244. 107470 13643s' 46740 164920 139^' 3897 ' 
109690, 100690, 138850. 118504, 147781, 146928, 601373 126449 139191, 138971, 

US* loolf 5 ' 164342 ' 104220, 118503 ' 136538 ' 1 37241, 601663. 601805,306250. 600377 600337 600509 
600983, 126110, 118507, 602548. 162010, 601268, 147671, 601284 113503 109190 603826 S S ' 
600956. 138245. 139139. 155541, 137192, 118445, 104250 300119 851l' m » l£m £££ S 
114610 138253, 146929, 602778, 135630, 147140, 146790 147045 601998. 5 378 X 60004 ' JSf 
37142, 113995, 138247, 182137, 147811 , 182132, 138248, 600414, 600563, 13716o' 1 WSSS'lS 

0 22' 52? !S 1^' I 79590 " 1 809 ° 3 ' 1 ° 9635 ' 30 ° 038; 138244 > 18213 °: 22 S Si 0 296 
602282, 179611, 143090, 100710, 601076, 600380, 170998, 600845, 118505 118510 190160 ifldjwn 1-wirT 

137140, 600456. 147880, 137190. 601156, 601956, 100730, 601194 60OM7 ' 

JSX' 2SJ?' 60303 02 ' S!5S' ? 2469 ' 603028 ' 601437 ' ' 146933 ' ' 17961 °. "3510, 600843, 305915. 

llllZ' f ' 1 ' 1 371 41 ' 1 36539 ' 162332 ^ 11B509 . 15 5540. 600233. 601109 601743 600409 14713Q 
131235, 603506. 603455, 151445, 602408. 601681 , 603540, 602454. 602490, 600214 600798 600232' IZlll" 
600730, 601122. 120577, 160994, 603248. 602621, 601305, 176946 118502 176888 602167 SSS'SS 
601470, 603453. 603065, 137163, 137166, 600846, 118495 138243, 192975. 38060 sS JSf S 
20340, 600315, 136425, 118493, 600731. 601502, 601503, 601751 603651. 602345 11 8496 60 598 

68 0 S S 1= 6 °° 946 ' 18693 °' 18697 °- 13317 ^- 308380. 120620.' 22 9 .' S5S 
186810,180220,173410,188070,131550,109691,147679,180240 173490 

300034, 264080, 114131, 146661 , 147370, 147545, 136352, 190182. 186780. ^26453 110700 147851 3083B5 
126451, 128239, 600018, 601487, 601040, 305990, 601604, 190181 , 188545 16264?' HmS IS S' 

6 oI« 8 n !o^ 0 ' 600 ° 73 ' 602376 ' 138491 ' 118946 ' 176761 ' ^6806 601770 IN^L" SSJJ'SS'SSS' 
09610, 192977, 159530, 602997, 164860, 300007, 146710, 601757, 150370 118444 600239 138033 7 
108360. 162642. 173850.603881.180246. 193210.173470.601469 134935 W^'^Zl^'S^ 
601790, 600221 , 109630. 602451 , 1 47265, 602626. 602001 602069 601642 600848 02776 sS S 
176886, 602250, 182454, 600222, 600821, 180247, 602730, 134637 167055.' 165^' 65195 S £££ 
600527, 600167, 153618, 601330, 601970, 600785. 600004, 601043 109135 ' ' 

Inn!of 6 ^ 613 60 0804, 601529, 102581, 600936, 126150, 182451,300107, ^90151. 602746 165070 170650 
600421 , 00997, 602648, 603243, 300035, 305670, 182455, 162651 , 109535 601320 120920 603W llltv 
601535, 102775, 601166, 180381, 176804, 602188, 600051, 601993 601773 603755 600241 Si 050 toltll' 
600253, 300130, 137026, 602368. 154570, 176945, 602005, 120980 602583 10277^' SS m SS 
601 71 1 , 1 04221 , 1 76884, 600441 , 1 08361 , 600665, 600446. 600799 600742 1 04219 603819 SotS' lUlTl 
601 978, 600578, 602566, 600551 , 601 898, 1 51 443, 602642, 3001 82 1 91 1 64 602356 !£o71 ' S2I QO RmS ' 
182889, 142703, 601576, 1 00735. 1 76882, 602969. 182453 601895,' eS £ S 602697 60^ 70 12 So 
601108. 176891. 176802. 600024. 600552, 601908. 176981 , 603510 151460 ' 

602237,602545,162341,603748,161565,600894,186711,137161.601839 601592 603507 60288S 43nnnn 
104222, 601116, 602643, 601896, 601404, 601381, 600180, 600870 123889,' 600206 ££* S5 So 
602950, 602174, 603159, 602042, 602297, 602836, 602355, 602304 603822 147267 603030 603821 ' tllZl' 
603820, 60331 7, 601 965, 601524, 601 974, 603823, 602728 601 741 1 37570 6o2oo S S 
186990,601115,600283,182180,600600,600066,600282 176944 600926 600895 ^ 

2!S I' 2888, 6038 ° 8 ' 603 ° 29 ' 602702 ' 602464 ' 603751 • 425000 601 MO. 1 sS' eS Soil 95 602646 
601909, 603692, 136950, 600543, 164320. 300086, 602933, 602993 137162 603281 iS SX5 eSooti' 
600144, 600869. 600493, 602346, 600342, 602336. 113955, 602893 601577 ' 
803749 'f 02892 ' 60381 0- 6 0 285 3, 600579, 601044, 603809, 186690. 600752,603304 600240 603232 600868 
602894, 602337, 603143, 109545, 602891, 602890, 602716, 603142, 602510 603144 603141 So090 6^2550 
603145, 107741 , 107730, 164761, 208900, 107269, 114208, 235200 158120 147440 300200 S' S 
151520, 131240. 139250, 601667, 153240, 218030, 147680 186740 600044 07940 So O2S2 600978 
601218, 142409. 601156. 601884, 601789, 125950, 171833 602221 139320.' U7760 90000 SS 
165180, 166490, 168470, 601366. 176000, 191160. 102578, 103320 112260 192240 0*091 6032M 
254210, 173610, 125480,600957, 118945, 120470. 136351 600725 176797. 02887 7351 SSsS 
252500, 104615, 601528, 600345, 601890, 151390. 602170, 186790 186960 ' 

600542,186973,600423.600058,176730,186945.104760,601309,601500 300091 113705 214500 fin27^ 
600726, 176885, 164875, 139330, 104640, 147840,600065, 131242 164160 600565 llZu SSS' 2££o 
80860, 190040, 147720, 222100, 223900, 600555, 600994 186770 120420 602o£' 60 099 6037^6 S' 
32890, 600007, 158105, 603886, 603884, 176830, 162200, 602458 601603 107910 TslZl' S molll' 
114855,601498,601511, 116831, 131390, 107773, 137800. 155750, 147795 2646 00. U20 5Sf 
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126455 

147470 

186940 

603254 

601753 

163800 

603412 

601771 

125255 

600179 

147780 

601 922 

203800 

600738 

600611 

308210 

266100 

600363 

603746 

182530 

601 804 

151440 

138190 

143100 

300300 

142910 

164975 

167040 

1 73445 

164005 

600750 

600596 

159555 

159350 

602293 

600508 

142461 

181460 

138030 

182331 

126850 

602931 

120360 

263520 

426000 

114835 

113000 

108355 

186590 

1 03280 



164040 
264700 
208150 
603180 
600554 
187410 
601872 
1 54275 
131100 
142445 
601985 
600341 
600576 
180069 
601570 
308230 
309900 
600850 
603732 
123842 
153420 
602919 
138550 
145505 
156490 
602414 
302020 
1 6741 5 
601754 
182160 
1 80200 
600281 
1 47559 
147245 
194540 
143055 
602430 
600422 
600156 
210250 
126065 
602932 
603033 
603126 
116955 
603356 
112205 
600250 
186720 
1 03070 



147683,263200, 188230, 184756, 147574, 179615 
173335, 124010, 147050, 132810,230350,276000 
188040, 188400, 100600, 601564, 603875, 603849, 601497, 



602952, 602775, 602584, 602570, 601045, 600632 
600937, 600694, 311360, 302910, 600173, 300139 
1 87020, 60381 8, 1 05850, 1 07941 , 307800, 601 897 
114204, 266300,114841, 116897, 152690,603261 
60091 0, 1 8701 1 , 1 88060, 600039, 600464, 1 86355 
600837, 602772, 600835, 131530, 602695, 137580 
154276, 245480, 142800, 600998, 602354, 601698 



147796, 226200, 600276, 147830, 150200, 601937, 189906, 



100650,601662, 189965, 159440, 190020,601054 
600749, 190090, 601767, 600475, 190080, 154550 
601020,601679, 162662, 601266, 601676, 601203 
600626,600201, 600514, 191100, 195002, 600567 
1 91 1 70, 269700, 1 91 092, 1 921 30, 60031 9, 238600 
266600, 306900, 600536, 1 94070, 201 910,1 88860 
306100, 602229, 603883, 102645, 186921 , 186910 



109270, 602060, 602041, 148180, 151385, 151625, 123830, 



124030, 134660, 153370, 134850, 601023, 182280 
601 801 , 1 3931 2, 1 39392, 1 81 031 , 601 622, 1 80700 
151510,123810, 123890, 131210, 179730,602663 
277700,601295, 139311 , 153455, 176311 , 1 76310 
300079, 168461, 145981, 158070, 106700, 107400 
603492,602018, 118485, 164951, 164730, 164720 
601933, 120290, 164013, 164012, 600667, 601534 



160782, 601501, 165230, 302060, 165120, 601492, 601483, 



600714, 192968, 600696, 601458, 601447, 173325 
600481,300155, 600453, 154230, 173430, 154030 
274500, 1 7961 6, 2751 20, 1 51 990, 3001 47, 601 863 
180231, 151430, 600753, 601582, 150240, 601584 
600262, 227400, 601997, 147684, 147678, 600761 
600945, 191163, 602065, 147558, 601641 , 147450 
186855, 192020, 191510, 146631 , 146630, 180950 



602299, 145680, 602311, 145260, 600230, 602350, 239100, 



601723, 143010, 155960, 601377, 602382, 275350 
602431 , 142460, 246530, 273300, 2491 00, 602461 
300127, 600164, 600765, 602582, 138800, 300101 
601017, 137240, 602671 , 602680, 602682, 136530 
6001 5 1 , 602738 , 131340, 131241, 300008, 1 3 1 1 95 
252650, 253200, 1 88380, 60081 0, 600396, 1 23841 
601900, 600050, 185430, 185520, 121800, 254770 



600295, 603035, 262600, 603046, 600009, 263400, 120150, 



176960, 176947, 147561 , 131320 
114130, 155730, 134638, 254150 



602559, 601883, 600618, 
300065,274000, 305100, 
212895,603598, 603584, 
603130, 603109, 603061, 
121360, 122559, 123000, 
602623, 139130,249210, 
602272, 601053, 602265, 



600764, 600410, 153440, 
184745,601721,601712, 
172400,601463,600685, 
600633, 191030, 190700, 
195000,600125, 600577, 
188595,257220, 600038, 
603785, 186852, 104311, 



134797,182125,134390, 
147570,603157, 162150, 
179020,602630,601231, 
601313, 133170, 141800, 
107720,601485, 103950, 
147620, 603406,164343, 
601513,601520, 164015, 



168820, 176820,601413, 
173360, 181590, 176876, 
211750,275355, 151675, 
601153, 224500, 601154, 
147581,602014, 602024, 
602096,602113,600583, 
601693, 146255, 602260, 



600220, 602396, 304050, 
602465, 141850, 601788, 
138720, 138420, 602609, 
182090, 600807, 601826, 
600119, 602809, 602839, 
182810, 123835, 600053, 
259700,222300, 120940, 



603177,300019, 603181, 
116899, 603302,313650, 



590085, 185605, 118960, 118940, 118455, 603149 
116940, 116930, 300088, 603264, 603272, 603274 
114830, 312760, 603378, 602061, 310400, 603414, 603437, 113730, 310200, 
111400, 111100, 110750, 603518, 603526, 602136, 603576, 309845, 603594, 
307200, 603628, 107777, 602165, 603653, 600899, 107273, 107271 , 603695, 
186760,186820,187700,600916, 147183, 105590, 104610, 146770,306400, 
102670, 301000, 603851 , 600874, 600871, 102200, 158380, 603889 



601848 
300116 
603583 
603055 
253800 
1 42408 
231550 

205400 
600594 
601599 
600014 
261680 
202110 
1 07266 

601047 
601589 
601252 
142995 
108330 
119530 
1 64340 

190198 
1 76970 
190170 
601969 
147573 
602154 
238300 

600424 
602498 
600157 
602724 
602840 
217030 
120700 

118425 
266265 
113710 
308310 
603705 
602227 



107265, 
122560, 

601766, 
300022, 
603454, 
603043, 
602901 , 
602457, 
229300, 

601055, 
600739, 
164009, 
303700, 
600566, 
261000, 
107270, 

135820, 
121013, 
137295, 
602358, 
165360, 
161560, 
173515, 

600489, 
178600, 
179838, 
180440, 
601150, 
600556, 
600231, 

602421, 
140210, 
138079, 
153245, 
128240, 
602921 , 
120436, 

603207, 
603331, 
147557, 
108728, 
603717, 
103390, 



6) Glycoprotein 

438 entries found, searching for "glycoprotein" 

231200,273800, 138700, 138600, 1 73510, 173515, 192974, 109770, 138610,603578, 195002, 182889,601275, 
602977, 300051, 173470, 1 71050, 138680, 142640, 194460, 176390, 138720, 600119, 185430, 138670, 180297, 
171060, 600073, 173335, 128239, 118850,602616, 110700, 1 64345, 159465, 159460, 1 91845, 173511,600564, 
159440, 256550, 153310, 1 68810, 600738, 151460, 138470, 190197, 603586, 601325, 601193, 162820, 158120, 
155970, 190920, 603201 , 603594, 601411 , 155735, 186730, 601028, 158030, 1 76391 , 115501, 137290, 601103, 
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600315, 176870, 601525. 601418, 137061, 137207. 137060. 176397. 176393, 176394, 182888. 160995 176398 
600718, 176396, 176392. 176399, 176401, 176395, 125645. 601774. 104175, 195000, 601785; 19340o' 6OOO65' 
111200,253700,152200,154550,310200,188230.256540,120980.130660 ■ ' 

107269, 158070, 156790, 131550, 135630, 138297, 155760, 231050, 603381, 143890. 176801, 118860 103600 
193210, 186820, 114890, 188060, 118910, 107271, 153340, 313470, 600074, 603062, 109270 602724' 603356' 
120324, 192340, 266265, 110900, 306400. 601327, 118457, 151250, 136836, 253000, 104640 19001o" 19000o' 
135620, 135600, 601852, 134797, 134370, 111250, 156225,600596, 187430, 164870, 151510, 186720*601962* 
600900, 151530, 180990, 180290, 182120, 603759, 179605, 158381 , 158373, 158343, 155740 15342o' 151675* 
142800, 176300, 153240, 140100, 134820, 134660, 134390, 125240, 120520. 112205, 111740 106100* 104225* 
103000, 139090, 176860, 102720, 1 76895. 180621. 602421, 602257. 252600, 252650. 180620! 227600*228960* 
208900,182145,208400,602643,602593,602461.602171,182900 601269 ' 
600270, 600033, 602894, 247100, 1 8691 0. 600239, 230500, 21 7800, 600867, 253220, 601 581 . 305600 162060 
158378, 188450, 186880, 186990, 188040, 154582, 164022, 188540, 153634, 189965. 193190 165095* 165220* 
158375, 152790, 158374, 151627, 182205, 155750, 191316, 151523. 151020. 165330! 166490! 192225*224050* 
224100, 185490, 192240, 173445, 167770. 185050, 153619,230800, 147740, 147730. 147670! 153618*236200* 
193300,249000, 176385, 153440, 147557. 152310,253270, 184900, 182331, 146760. 146661 272300*272800* 
142930, 169615, 171640,182520, 182310,300027, 147780,305370, 151525, 142858! 14284o! 308230* 308385* 
309060, 1 42445, 1 42290, 21 6950, 1 391 30, 600026, 600044, 600046, 1 38971 , 1 3896o! 1 381 40 6001 33* 6001 82* 
600215,173350,173360,135820,600403,600410.600491,134934 147563 

600535, 133700, 600587, 133170, 131390, 600616, 600713, 131195, 600722, 130160, 600742 600751 600764 
600769, 600798, 180540, 600978, 129010, 601040, 601050, 125505, 125490, 124010, 601194 601211 ' 601253* 
147380, 120930, 601313, 120830, 120620, 601365, 601368, 120220, 601456, 601484 117700* 147045* 601652* 
274500, 116930, 116880, 601788, 116805, 114250, 113810,601880,601914, 601932, 111700 ' 602046* 602053* 
602108, 602243, 176820, 178630, 178640, 111680, 300017, 109535, 142860, 602692, 109530 602746* 308840* 
07740, 107400, 603104, 603125, 603126, 603130, 107273, 603197, 107266, 603241 , 603243! 104614* 312080* 
104610,603492.103390,102670,102480,100735,603665,603756,100730,603777 

7) Protein kinase (a member of signal transduction) 
729 entries found, searching for "protein kinase" 

600289, 160900, 601032, 176948. 600899, 176947, 176894, 601939, 602980, 601314. 601955 601335 600447 
602130, 603606, 601158, 602006, 602549, 602678, 176872, 603014, 602520, 18694s! 601254 602315* 601263* 
177061 , 176873, 600497, 603607, 602449, 602740, 602898, 602741 , 300083, 602899, 114080 603412 601184* 
602739, 1 7691 0, 600050, 1 76949, 602741 , 602743, 600341 , 601 951 , 601 21 2, 600038, 1 56490,' 600855 * 600758* 
603455, 156491 , 603424, 300075, 180381, 151510, 600168, 600085, 601890, 602896 602123 600221 ' 602904' 
602687, 176945,602897,600222,602098,601685, 176944, 109635, 601795, 601679! 603722 602521 ' 601436* 
191164, 602004, 176942, 601684, 300101, 601299, 601434, 602615, 601435, 17696o! 153390 60316e' 17909o' 
300300,176871,602216,176912,176893,603453,300185,177060.176943 ' 
176981, 603441, 176970, 116899, 603442,311550, 603495, 169190, 600448, 602399, 176946 176977 188830 
602350,601591,601782, 137026,400008,300094,600058,600831, 176892,601639,602919 1 76911 ' 176975* 
3001 72, 603072, 600583, 177015, 1 76982, 1 76980, 603248, 1 76941 , 602757, 600870. 603440 ' 148750* 601 959* 
600869, 602448, 600408, 306000, 266200, 602539, 602425, 6001 60, 31 0200, 20890o! 60099?! 1 86973* 305360* 
179611, 307030, 603262, 601050, 602758, 147795, 311800, 600664, 600441 , 603251 ,' 600527 313650* 123831 ' 

191315, 602958, 123832, 602989,602614, 123828. 136350, 601284, 602940, 601329, 600173 171834* 124095* 
603464, 603584, 116953, 300035, 600600, 109135, 601232, 601988, 603850, 603384, 603184* 602990* 125855* 
601047,603460,603258,602748,603723,164870.603368 139310 601818 

602913, 603005, 602221, 173570, 147670, 602610, 600524, 113508, 251170, 142370, 142408 191306 147796 
118423, 600286, 601074, 600765, 600922, 179610, 602747, 601953. 601132, 600136, 601288 601289* 602297* 

191316, 601535, 602925, 125450, 602839, 602168, 602461, 105590, 114085, 600714. 154235 603261 ' 109270* 
191305, 602625, 602366, 602749, 600927, 300134, 603496, 136352, 176790, 600073 601523 115441* 191170* 

21?o!« 11690 °' 188250, 1 ° 2576 ' 603764, 603497 ' 603304 ' 601934 ' 602255 . 600004, 600526,'601589*600856* 
601269, 601465, 601207. 601935, 600863, 602394, 601983, 602048, 300200, 179050. 164761 189980* 602626* 

™ 6 ° 2694 ' 602731 ' 1253 ° 5 ' 12381 °" 3001 39 ' 601 792 ' 275350 ' 171833 - 193525! 601 540! 162200* 601826* 
1 1 6898, 601 1 08, 601 528, 600954, 602609, 602838, 6031 67, 601 522 600505 

600431 , 600267, 188555, 600140, 308240, 600137, 600011, 151410, 601530, 134934, 116952 300142 191311 
601603, 603889, 601 728, 165160, 165070, 136351 , 601836, 601839, 133090, 601595. 602745* 602516* 154950* 
603601 , 602052, 154045, 603583, 600695, 602933, 602756. 601014. 602474, 602887! 172270 601366* 180220* 
602189, 114105, 600963, 603369, 601296, 603289, 602265, 602337, 600066, 60045e! 602387 600917* 139270* 
603271, 188345, 601231 , 115440, 115442, 600778, 164920. 175100,310300. 60176?! 603168 603140* 603113* 
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116940, 
601114, 
108355, 
603618, 
602527, 
272800, 
308000, 
601045, 
109690, 
258501 , 
154500, 
122560, 
603296, 
602102, 
173470, 
301300, 
600966, 
602642, 
603302, 
181031, 
125255, 
600327, 
600206, 
602288, 
103850, 
1 93245, 



1 62060.. 

300022, 

601999 

603814 

602526 

601497 

602338 

300189 

253800 

190040 

131195 

602382 

603272 

601683 

602930 

590060 

308100 

300041 

600737 

182160 

1 20090 

600799 

263200 

176761 

274270 

603729 



603048, 603275, 601146, 600434, 602336, 602310, 602308, 147522, 603311, 
164785, 167414, 123280, 600007, 600560, 229300, 151430, 176740, 602426, 
601524, 190151, 202500, 601592, 600723, 102750, 600753, 
147521 , 602514, 603015, 602525, 602524, 172471 , 602647, 602590 : 600864, 
143890, 164730,232300, 131550, 116951,601713, 154550, 603434, 138160, 
602038, 107940, 191190, 273800, 193300, 120150, 603068, 603259, 102582, 
602689, 185605, 164040, 600119, 152690, 601496, 602188, 123900, 602860, 
164860, 602505, 230800, 141850, 147545, 110300, 600051 , 138600, 233700, 
311770, 156225, 603719, 191030, 309000, 601893, 190030, 189972, 601247, 
300121, 180380,314850, 600438, 306400, 179715, 190450, 600636, 180901, 
115460, 603078, 603775, 603760, 158900, 602354, 102680, 
103320, 603597, 603510, 232600, 133170, 603353, 603345, 603318, 603309, 
114240, 147557, 232400, 139259, 602122, 230200, 164014, 248600, 601703, 
158070, 602170, 602007, 601306,248610, 601387, 161770, 132800, 164760, 
603691, 165360, 123835, 603544, 147265, 107777, 602377, 600605, 602869, 
601059, 168461, 142600,314555, 142410, 170290, 602498, 190090, 311030, 
600936, 171900, 190182, 172405, 102600, 603754, 600281 , 172430, 138249, 
602659, 192240, 603473, 136537, 603134, 182280, 603084, 125660, 189990 : 
300144, 305400, 188545, 600652, 189940, 600633, 603009, 
148500, 602384, 600650, 602190, 171150, 1 73490, 164765, 147780, 600910, 
173390, 602668, 600483, 602995, 602996, 180860, 142445, 116949, 602681, 
164757, 230450, 602730, 600782, 164880, 232700, 116935, 602303, 603203, 
300034, 601487, 600980, 600956, 601985, 600884, 601956, 191041 , 601568, 
113900,600581,516050, 190070, 164940,600544, 107941,312861,602373, 
601922, 602355, 147420, 311010, 163800, 600536, 602018, 307200, 603619, 
603752, 118888, 603796, 305600, 603852, 603853, 603886 



227645, 114110, 
109636, 602775, 



172470, 
601702, 
601099, 
164960, 
109700, 
190120, 
173335, 

230400, 
194364, 
601902, 
601121, 
142409, 
300039, 
133430, 

601272, 
603027, 
164710, 
601586, 
123836, 
603652, 



602538, 
126335, 
253700, 
600618, 
238600, 
600179, 
601290, 

139250, 
150000, 
601253, 
602388, 
190181, 
603154, 
603755, 

147880, 
182125, 
600244, 
155960, 
106165, 
193001, 



8) Calmodulin kinase (a member of signal transduction) 
35 entries found, searching for "calmodulin binding" 

300172, 114180, 302020, 139312, 602584, 602293, 600310, 603379, 114106, 602350, 114105, 114213, 138249, 
153430, 177061, 182520, 102680, 600489, 601478, 306000, 600490, 600922, 601302, 601568, 168890, 164795, 
163730, 602698, 602699, 602987, 102681 , 603384 

9) Transcription factor 

71 7 entries found, searching for "transcription factor" 

305371, 189963, 164177, 600297, 600298, 163260, 189907, 173110, 600733, 189903, 189972, 600438, 600281 , 
189962, 157670, 602272, 600609, 189968, 189971 , 189889, 107580, 189908,601748, 601542, 601632,600610, 
189969, 600480, 601760, 176310, 603022, 600519, 601750, 123803, 600520, 600494, 164176, 602460, 603246, 
113725, 107773, 189904, 602228, 147141, 132890, 189964, 164175, 142410,313650, 164011, 164014, 164005, 
601714, 600492, 164012, 601878, 600673, 600635, 191523, 602191 , 600490, 602149, 189967, 140580, 164343, 
603256, 600727, 600729, 600728, 140581 , 600571 , 601511 , 102582, 189906, 600660, 124097, 314310, 600860, 
600662, 184756, 600777, 600013, 600172, 600489, 601425, 600786, 600502, 602406, 600743, 189965, 603257, 
602751, 602542, 603148, 603107, 603789, 602318, 156845, 123811 , 600425, 

600540, 600695, 601622, 147574, 601601 , 602407, 602150, 601538, 600663, 603306, 165170, 187040, 602444, 
189902, 189973, 600659, 600661 , 60101 0, 600788, 602617, 601602, 602053, 601742, 300039, 602438, 602976, 
600744, 602543, 602479, 600481 , 600473, 603739, 600426, 603738, 189901 , 603677, 600427, 603255, 600607, 
600379, 189909, 601679, 600787, 602160, 601043, 601397, 601044, 600366, 300025, 602575, 602669, 601804, 
601 801 , 1 7631 2, 1 7631 1 , 600746, 602480, 602944, 600967, 60091 2, 306700, 306900, 1 93400, 601 206, 480000, 
191160, 601861, 164008, 600475, 600773, 600772, 600774, 142409, 156490, 600589, 601490, 151385, 600599, 
184757, 602955, 234000, 603433, 603349, 603198, 602294, 600390, 603628, 147620, 600211 , 601 787, 601863, 
147470, 603795, 603734, 152760, 104155, 128990, 601729, 600197, 147370, 

173490, 603423, 600822, 188595, 603243, 600573, 601689, 142765, 603181 , 600879, 603731 , 600288, 602295, 
121360, 164874, 300019, 162095, 602355, 603258, 126090, 159540, 300070, 600555, 600664, 601874, 153245, 
191191,601126, 601512, 146733, 131550, 142385,601796, 603406, 602959, 601734,601732, 139191 , 139139, 
600633, 138971 , 600006, 603170, 601488, 147576, 147680, 601498, 602630, 602643, 603364, 600914, 154040, 
602746, 128992, 143089, 160900, 600140, 134934, 133510, 176860, 190180, 601150, 601175, 170993, 601361, 
122560, 602778,308230, 602903, 309550, 601788, 602946, 159970, 124092, 180200, 173410, 602356,603015, 
600779, 603111 , 187930, 602614, 600951 , 603200, 602369, 164770, 147569, 603300, 603301 , 159980, 134638, 
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603431, 147730, 603366, 603348, 600556, 602136, 164160, 310200, 152390, 

601241, 116897, 137295, 600576, 194070, 601487, 600698, 164810, 601769.. 141900, 602225, 275350, 131100, 
179755, 600075, 162200, 165160, 116806, 600899, 123810, 133450, 216400. 278700, 190080* 164730* 191170 
193300,600618,600999, 601090, 106150,601843, 133530, 110700,602550, 138040. 133430. 300133' 163731* 
602302, 126337,309548, 180245, 126110, 602291 , 109565, 107400, 314670, 601444. 143100. 104760* 106180* 
601953, 600584, 125852, 602419, 600401 , 142200, 107680, 167414, 600020, 188400, 208900 175100 602700* 
601828, 139320,602777, 600185,602681,603023,314997,602848, 600284, 102578, 114290^ 165095* 137070* 
602991 , 602421 , 600005, 602996, 314995, 152200, 151900, 112260, 129010, 600892. 273800 1 76760* 602341 ' 
490000, 136533, 400003, 601007, 602229, 603620, 602218, 602116, 602020, 

142000, 601955, 126340, 120150, 193067, 182452, 142461 , 194558, 180660, 600756. 160745. 107741 106210 
157640, 186770, 146738, 603759, 213700, 147880, 152391,277900, 151626, 107730, 600711 600246' 107470* 
600237, 223100, 107748, 600065, 600349, 426000, 194500, 154030, 227650, 600247. 314980^ 109560 305370* 
600800, 301000,277700,600838,312173,600439, 600440, 191315, 601595, 190450. 190070. 190020* 162360* 
131320, 133540,600993,601993,159530, 601902, 602868, 181590, 601724, 602260. 601093^ 187270*164761* 
602102, 603245, 136950, 106100, 601182, 167410, 601897, 602896, 170100, 602506. 104150 1 76730* 601600* 
187011, 102600, 180380, 162080, 603450, 142967, 602301, 126375, 603372, 603355. 164720 603250 167409* 
167415, 602897, 601565, 185250, 182138, 601851 , 600749, 601575. 194548, 

154500, 601365, 194541, 601621, 601623, 601531 , 600790, 194355, 123830, 123812. 154540. 601415, 143055 
601386, 194550, 186930, 131290, 601320, 601620, 601754, 601313, 184430, 182900. 182500^ 600725* 147870* 
154365, 116953, 601297, 601296, 601265, 600796, 120436, 601644, 601930, 601643. 230200 601645* 601972* 
600861,602009,601172, 601158, 601646, 180630, 600821, 118440, 601656,601647. 150200* 601125 601671 ' 
141850, 116899, 600697, 109270,202110, 150570, 601108, 191339,601063, 109691, 180240,203100*151430* 
179710, 111000, 176797, 238600, 104311,240300, 125255, 600423, 158070, 602439. 600324* 112261 243305* 
602474, 174762, 600613, 602539, 138890, 138720, 114550, 173865, 602582, 602584^ 173510*600250 602627* 
1 73325, 602635, 246530, 1 72425, 6001 93, 602691 , 6001 88, 1 70998. 152790, 

168468, 256540, 225250, 600848, 143400, 168461 , 262600, 168360, 601912, 602951 . 600017 230000 266600 
602981, 272800, 109150, 102200, 603025, 603026, 603109, 167050, 603127, 603128. 165240. 230400,' 31 3700 ' 
164975, 164875, 602017, 115500, 235800, 164873, 602110, 164785, 1 64772, 312865, 603296 600542* 164740* 
602125, 309801 , 602148, 300007, 306955, 603368, 116940, 602181 , 603416, 126650. 163920 300024* 603437* 
602209, 603576, 603607,305435, 600944, 180410, 303630, 159557, 301870, 132810. 100790 603849 603862' 
603881, 



[0074] There are several methods for analyzing the expression levels of genes associated with diseases. Differences 
in gene expression levels between diseased and normal tissues are studied by the analytical methods, for example 
Northern hybridization and differential display. Other examples include a method with high-density cDNA filter, a method 
with DNA microarray and methods with PCR amplification (Experimental Medicine, Vol.17, No. 8, 980-1056 (1999)- 
Cell Engineering (additional volume) DNA Microarray and Advanced PCR Methods, Muramatsu &Naba (eds.). Shu' 
junsya). The levels of gene expression between diseased tissues and normal tissues can be studied by any of these 
analytical methods. When explicit difference in expression level is observed for a gene, it can be concluded that the 
gene is closely associated with a disease or disorder Instead of diseased tissues, cultured cells can be used for the 
assessment. Similarly, when gene expression is explicitly different between normal cells and cells reproducing disease- 
associated specific features, it can be concluded that the gene is closely associated with a disease or disorder When 
the expression levels of genes are evidently varied during major cellular events (such as differentiation and apoptosis) 
the genes are involved in the cellular events and accordingly are candidates for disease- and/or disorder-associated 
genes. Further, genes exhibiting tissue-specific expression are genes playing important parts in the tissue functions 
and, therefore, can be candidates for genes associated with diseases and/or disorders affecting the tissues. 
[0075] For example, non-enzymic protein glycation reaction is believed to be a cause for a variety of chronic diabetic 
complications. Accordingly, in endothelial cells, genes, of which expression levels are elevated or decreased in a gly- 
cated protein-dependent manner, are associated with diabetic complications caused by glycated proteins (Diabetes 
1 996, 45 (Suppl. 3), S67-S72; Diabetes 1 997, 46 (Suppl. 2), S1 9-S25). The onset of rheumatoid arthritis is thought to 
be involved in the proliferation of synovial cells covering inner surfaces of joint cavity and in inflammatory reaction 
resulted from the action of cytokines produced by leukocytes infiltrating into the joint synovial tissues (Rheumatism 
Information Center, http://www.rheuma-net.or.jp/). Recent studies have also revealed that tissue necrosis factor (TNF)- 
<x participates in the onset (Current opinion in immunology 1 959, 1 1 , 657-662). When the expression of a gene exhibits 
responsiveness to the action of TNF on synovial cells, the gene is considered to be involved in rheumatoid arthritis 
Many genes acting at the downstream of TNF-. and IL-1. among inflammation-associated cytokines have been previ- 
ously identified. The respective stimulations are transduced through independent pathways of signaling cascade. There 
exists another signaling cascade for both stimulations, wherein NF-.B is a common transducing molecule shared by 
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the two stimulations (J. Leukoc. Biol., 1994, 56(5): 542-547). It has also been revealed that many inflammation-asso- 
ciated genes, including IL-2, IL-6 and G-CSF, are varied in the expression levels in response to the signal through the 
common pathway (Trends Genet. 1999, 15(6): 229-235). It is assumed that genes of which expression levels are varied 
in response to the stimulation of TNF-. or IL-1. also participate in inflammation. Genes associated with neural differ- 
5 entiation can be candidates for causative genes for neurological diseases as well as candidates for genes usable for 
treating the diseases. 

[0076] Clones exhibiting differences in the expression levels thereof can be selected by using gene expression anal- 
ysis. The selection comprises, for example; analyzing cDNA clones by using high-density cDNA filter; and statistically 
treating the multiple signal values (signal values of radioisotope in the radiolabeled probes or values obtained by meas- 

10 uring fluorescence intensities emitted from the fluorescent labels) for the respective clones by two-sample Hest, where 
the signal values are determined by multiple experiments of hybridization. The clones of interest are selectable based 
on the statistically significant differences in the signal distribution at p<0.05. However, selectable clones with significant 
difference in the expression levels thereof may be changed depending on the partial modification of statistical treatment. 
For example, the clones may be selected by conducting statistical treatment with two-sample t-test at p<0.01 ; or genes 

15 exhibiting more explicit differences in the expression levels thereof can be selected by performing statistical treatment 
with a pre-determined cut-off value for the significant signal difference. An alternative method is that the expression 
levels are simply compared with each other, and then, the clones of interest are selected based on the ratio of the 
expression levels thereof. 

[0077] Clones exhibiting differences in the expression levels can also be selected by comparing the expression levels 
20 by PCR analysis, for example, by using the method of determining the band intensities representing the amounts of 
PCR products with ethidium bromide staining; the method of determining the radioisotopic signal values orf luorescence 
intensities of the PCR products when radio-labeled or fluorescence-labeled primers; or the method of determining the 
values of radioisotope signals or fluorescence intensities of the probes hybridized to the PCR products when radio- 
labeled or fluorescence-labeled probes, respectively, are used in the hybridization. If the expression level ratios ob- 
25 tained in multiple PCR experiments are constantly at least 2-fold, such a clone can be judged to exhibit the difference 
in the expression level. When the ratios are several-fold or not less than 10-fold : the clone can be selected as a gene 
exhibiting the explicit difference in the expression level. 

[0078] A survey of genes of which expression levels are varied specifically to the glycated protein in the endothelial 
cells revealed three genes with elevated expression levels, NT2RP2001538, NT2RP4001001 and Y79AA1 000967. 
30 These clones are genes associated with diabetes. 

[0079] A survey of genes of which expression levels are varied in response to TNF (Tumor Necrosis Factor-alpha) 
in the primary cell culture of synovial tissue detected the following clones with elevated expression levels in the presence 
of TNF: 

BNGH41000020, HEMBA1 000349, HEMBA1000634, HE MBA 1000671 , H EM BA1 000835, HEMBA1000962, 

35 HEMBA1002178, HEMBA1002195, HEMBA1002239, HEMBA1002420, HEMBA1002524, HEMBA1002992, 
HEMBA1003315, HEMBA1003392, HEMBA1003487, HEMBA1003602, HEMBA1004067, HEMBA1004797, 
HEMBA1005337, HEMBA1005489, HEM3A1006916, HEMBB1000668, HEMBB1 000905, HEMBB1001547, 
HEMBB1001573, HEMBB1002041 , HEMBB1002663, MAMMA1000652, MAMMA1000810, MAMMA1001634, 
MAMMA1002091, MAM MA1 002234, NT2RM2000306, NT2RM4000417, NT2RP1000002, NT2RP10001 81 , 

40 NT2RP1 000740, NT2RP2000694, NT2RP2001921 , NT2RP2002527, NT2RP2004495, NT2RP2004606, 
NT2RP2005163, NT2RP2005463, NT2RP2006134, NT2RP3000171 , NT2RP3000652, NT2RP3001195, 
NT2RP3001 976, NT2RP3003473, NT2RP3003874, NT2RP3004090, NT2RP3004294, NT2RP3004557, 
NT2RP3004647, NT2RP4000108, NT2RP4001001 , NT2RP4001 877, OVARC1 000090, OVARC1000105, 
OVARC1000275, OVARC1000439, OVARC1001607, PLACE 1000740, PLACE1000927, PLACE1001016, 

45 PLACE1001100, PLACE1001464, PLACE1001500, PLACE1001918, PLACE1002095, PLACE1002547, 
PLACE1 003644, PLACE1004519, PLACE1005031 , PLACE1005410, PLACE1005736, PLACE1006219, 
PLACE1 006809, PLACE1008716, PLACE101 0081 , THYRO1001770, Y79AA1000127, Y79M1 000207, 
Y79AA1 000270, Y79AA1 000876, Y794A1001013, Y79AA1001264, Y79AA1001272, Y79AA1001328, 
Y79AA1 001430, Y79AA1001530, Y79AA1001799 

so [0080] Clones with decreased expression levels in the presence of TNFD are NT2RM4000326, NT2RP1000300, 
NT2RP2000514, NT2RP2001 755, NT2RP2006042, NT2RP3000481 , NT2RP3002790. These clones are candidates 
for rheumatoid arthritis-associated genes. 

[0081] A survey of genes of which expression levels are varied in response to TNF. (Tumor Necrosis Factor-alpha) 
or IL-1 . (lnterleukin-1 beta) in a human T cell strain, Jurkat cell, revealed the following clones with elevated expression 
55 levels in the presence of TNF: 

MAMMA1 0001 41 , MAMMA1 000788, MAMMA1 001237, MAMMA1 002070,NT2RM2000582, NT2RM20021 09, 
NT2RP1 000679, NT2RP2003664, NT2RP2004108, NT2RP2005597, NT2RP3001592, NT2RP3002738, 
NT2RP3004133, NT2RP3004294, NT2RP3004321 , NT2RP3004557, PLACE1002547, PLACE1003573, 
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^MIOoS PLACE1 ° 08744 PLACE1010011, PUCE1010713, PLACE1011181, Y79AA1000776, 

Sa,io A n S /Zn, 0 , f9eneS ° f WhiCh eXPreSSi ° n ' eVe,S 3re Varied in res P° nse t0 ,he ^"'ation for inducing cell dif- 

^SS^SZ^TT 6 ^^ ° f USin9 RA/inWbit0r < inhiM ° rf °'<* » -M-onJ, in cultured^ L of 
uc»«f/« revealed ,he following clones with elevated expression levels in the presence of RA 

II m = = SH E5 

is §§ lip == ss s= 



NT2RM4000198, 
NT2RP1000181, 
NT2RP2000092, 
NT2RP2001388, 
NT2RP2002824, 
NT2RP2003383, 



NT2RM4000284 f 
NT2RP1 000261, 
NT2RP2000479, 
NT2RP2001538, 
NT2RP2002942, 
NT2RP2003469, 



NT2RM4000593, 
NT2RP1 000465, 
NT2RP2000610, 
NT2RP2001878, 
NT2RP2002974, 
NT2RP2003664, 



NT2RM4001321, 
NT2RP 1000468, 
NT2RP2000663, 
NT2RP2002015, 
NT2RP2002976, 
NT2RP2003940, 



NT2RP2004524, NT2RP2004556, NT2RP2004670, NT2RP2005069, 



NT2RP2005247, 
NT2RP2005774, 
NT2RP3000011, 
NT2RP3000652, 
NT2RP3001170 : 
NT2RP3001976, 
NT2RP3003532, 
NT2RP3004090, 
NT2RP4000634, 



NT2RP2005425 
NT2RP2005878 
NT2RP3000125, 
NT2RP3000677, 
NT2RP3001240, 
NT2RP3002790, 
M2RP3003729, 
NT2RP3004130, 
NT2RP4002451 



OVARC1 000553, OVARC1 000775 
O VA RC 1 00 1 030 , O VARC 1 00 1 049 



PLACE1 000442, 
PLACE1002518, 
PLACE1 003839, 
PLACE1 005736, 
PLACE1 007626, 



PLACE 1000927, 
PLACE1002547, 
PLACE 1004078, 
PLACE 1005768, 
PLACE1 008282, 



NT2RP2005463, 
NT2RP2005883, 
NT2RP3000171, 
NT2RP3000818, 
NT2RP3001322, 
NT2RP3002900, 
NT2RP3003874, 
NT2RP3004202, 
NT2RP4002715, 
OVARC 1000853, 
OVARC1001132, 
PLACE1 000986, 
PLACE1 002967, 
PLACE1 004441, 
PLACE1 005815, 



NT2RP2005514, 
NT2RP2005887, 
NT2RP3000232, 
NT2RP3000820, 
NT2RP3001388, 
NT2RP3002983, 
NT2RP3003939, 
NT2RP3004294, 
OVARC1 000090, 
OVARC1 000873, 
OVARC1 001596, 
PLACE1001100, 
PLACE 1003407, 
PLACE 1004450, 
PLACE1 006073, 
PLACE1 008985, 



NT2RP1 000002, 
NT2RP1 000579, 
NT2RP2000694, 
NT2RP2002304, 
NT2RP2003179, 
NT2RP2004069, 

NT2RP2005535, 
NT2RP2006099, 
NT2RP3000460, 
NT2RP3001044, 
NT2RP3001542, 
NT2RP3003000, 
NT2RP3004025, 
NT2RP3004640, 
OVARC1 000208, 



NT2RP1 000050, 
NT2RP1 000679, 
NT2RP2000903, 
NT2RP2002721 , 
NT2RP2003302, 
NT2RP2004108, 

NT2RP2005541, 
NT2RP2006134, 
NT2RP3000481, 
NT2RP3001061, 
NT2RP3001592, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP4000108, 
OVARC1 000275, 



OVARC1 00091 6, OVARC1 000995, 
OVARC 1 0021 78, PLACE1 000256, 



PLACE 1001 123, 
PLACE1 003428, 
PLACE1 005669, 
PLACE1 006208, 
PLACE1 010445, 



PLACE1001795, 
PLACE1 003644, 
PLACE1 005682, 
PLACE1 007296, 
PLACE1011708, 



THYRO1001205, 
Y79AA1 000226, 
Y79AA1002213, 



. PLACE1008984, , 
PLACE1011978, PLACE4000455, SKNMC1000004, THYRO1000036 

SZS' IZnlnS THYR01000 " 9 ' THYRO1001063, THYRO1001128, 
IHYRO1001327 : THYRO1001523, THYRO1001725, THYRO1001770 Y7QAAmnn9H7 

VrlToS Y79AA1001056 ' V79AA1001062, Y79AA100™90 7 °' 

[0084] The purvey also revealed the clones of which expression levels were decreased in the presence of RA Th. 

ar ^ NGH41 ° 0002 ?' HEMBA1005070, NT2RP2005027. NT2RP3003473 and Y79M 002376 
Sor 6 SUrVeV fUrther reVea ' ed ,0 "° Win9 Cl ° neS With etevate ° «P«*n levels in the presence of RA/ 

== SS =5 = ==: = 

NT2RM2000582, NT2RP1001004, NT2RP2000663, NT2RP200069i NT2RP20009n*' 

NT2RP2002674, NT2RP2002974, NT2RP2003383, NfT2RP2004 06 9 nSrSSob 

NT2RP2005069, NT2RP2005425. NT2RP2005463 fWoOosS' NtIrP^obS' 

NT2RP3000460, NT2RP3000838, NT2RP3001044 NT2RP300 2^ 88 



MAMMA1001344, 
NT2RP2001388, 
NT2RP2004837, 
NT2RP2005887, 
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NT2RP3002721, NT2RP3002738, NT2RP3003469, NT2RP3004083, NT2RP3004130, NT2RP3004202, 
NT2RP3004294, NT2RP3004640, NT2RP4000108, NT2RP4002451 , NT2RP4002715, OVARC1 000275, 
OVARC1 000467, OVARC1 000553, OVARC1 000853, OVARC1 000873, OVARC1000916, OVARC1 000995, 
OVARC1001030, OVARC1 001222, OVARC1 001596, OVA RC 1002058, OVARC1 002178, PLACE1 000927, 

5 PLACE1001123, PLACE1001407, PLACE1001464, PLACE1001564, PLACE1001795, PLACE1002547, 
PLACE1003407, PLACE1003644, PLACE1003845, PLACE1004441 , PLACE1004482, PLACE1005410, 
PLACE1 005601, PLACE1 005725, PLACE1 005736, PLACE 1006093, PLACE1006219, PLACE1 006290, 
PLACE1006716, PLACE1007296, PLACE1007626, PLACE1008359, PLACE1010968, PLACE1011364, 
PLACE1011824, THYRO1000678, THYRO1000776, THYRO1000999, THYRO1001113, THYRO1001237, 

10 THYRO1001523 : Y79AA1 000226, Y79AA1 000888, Y79AA1001430 

[0086] The same survey further revealed the following clones with elevated expression levels in the presence of RA/ 
inhibitor: 

HEMBA1000349, HEMBA1001297 ! HEMBA1001878, HEMBA1005070, HEMBA1006482, HEMBB1001 959, 
NT2RM2001939, NT2RP1000981 , NT2RP2001469, NT2RP3003473, OVARC1001132, PLACE1001655, 
15 Y79AA1000127, Y79AA1 002381 . These clones are associated with neural differentiation and, therefore, are candi- 
dates for genes associated with neurological diseases. 

[0087] Based on the functional analyses using a secretory protein, membrane protein, or proteins associated with 
signal transduction, glycoprotein, transcription, or diseases, it is possible to develop a medicine. 
[0088] In case of a membrane protein, it is most likely to be a protein that functions as a receptor or ligand on the 
20 cell surface. Therefore, it is possible to reveal a new relationship between a ligand and receptor by screening the 
membrane protein of the invention based on the binding activity with the known ligand or receptor. Screening can be 
performed according to the known methods. 

[0089] For example, a ligand against the protein of the invention can be screened in the following manner. Namely, 
a ligand that binds to a specific protein can be screened by a method comprising the steps of: (a) contacting a test 
25 sample with the protein of the invention or a partial peptide thereof, or cells expressing these, and (b) selecting a test 
sample that binds to said protein, said partial peptide, or said cells. 

[0090] On the other hand, for example, screening using cells expressing the protein of the present invention that is 
a receptor protein can also be performed as follows. It is possible to screen receptors that is capable of binding to a 
specific protein by using procedures (a) attaching the sample cells to the protein of the invention or its partial peptide, 

30 and (b) selecting cells that can bind to the said protein or its partial peptide. 

[0091] In a following screening as an example, first the protein of the invention is expressed, and the recombinant 
protein is purified. Next, the purified protein is labeled, binding assay is performed using a various cell lines or primary 
cultured cells, and cells that are expressing a receptor are selected (Growth and differentiation factors and their re- 
ceptors, Shin-Seikagaku Jikken Kouza Vol.7 (1991) Honjyo, Arai, Taniguchi, and Muramatsu edit, p203-236 : Tokyo- 

35 Kagaku-Doujin). A protein of the invention can be labeled with Rl such as 125 l, and enzyme (alkaline phosphatase 
etc.). Alternatively, a protein of the invention may be used without labeling and then detected by using a labeled antibody 
against the protein. The cells that are selected by the above screening methods, which express a receptor of the protein 
of the invention, can be used for the further screening of an agonists or antagonists of the said receptor. 
[0092] Once the ligand binding to the protein of the invention, the receptor of the protein of the invention or the cells 
expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the ligand and 
receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of the receptor, 
for example) by utilizing the binding activities. 

[0093] When the protein of the invention is a receptor, the screening method comprises the steps of (a) contacting 
the protein of the invention or cells expressing the protein of the invention with the ligand, in the presence of a test 

45 sample, (b) detecting the binding activity between said protein or cells expressing said protein and the ligand, and (c) 
selecting a compound that reduces said binding activity when compared to the activity in the absence of the test sample. 
Furthermore, when the protein of the invention is a ligand, the screening method comprises the steps of (a) contacting 
the protein of the invention with its receptor or cells expressing the receptor in the presence of samples, (b) detecting 
the binding activity between the protein and its receptor or the cells expressing the receptor, and (c) selecting a com- 

50 pound that can potentially reduce the binding activity compared to the activity in the absence of the sample. 

[0094] Samples to screen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the protein of the invention can also be used 
as a sample. 

55 [0095] A compound isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the protein. By utilizing an assay that monitors a change in the intracellular signaling such as phosphorylation which 
results from reduction of the binding between the protein and its receptor, it is possible to identify whether the obtained 
compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a molecule that can 
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can ^ TT I" Pr0 ' e,n i,S associated P rotei "s (including a receptor) in vivo. Such compounds 

rooofit T m9S <0r PreCaU,i ° n ° r CUreS °' a disease with which ,he b associated 

[0096] Secretory proteins may regulate cellular conditions such as growth and differentiation It is possible to find 

ce ZolT ' regU,a,eS Ce " Ular C ° nditi0nS by addi " 9 the Secre,0f > P rotein ° f ■» invention ,o a certaL md of 
ceMnd perform.ng a screen,ng by utilizing the ce.lular changes in growth or differentiation, or aeration of a pamcu.ar 

[0097] The screening can be performed, for example, as follows. First, the protein of the invention is expressed and 

r T m ^ inan L ,0,^, • ThGn ' ' he PUri,ied Pr ° ,ein iS --oinjectedadded into a variou Id oSHnes or 
pnma^ cultured cells, and the change in the cel. growth and differentiation is monitored. The induction ofTpamcula 
gene that is known to be involved in a certain cellular change is detected with the amounts of mRNA and p^ote n 
Alternatively, the amount of an intracellular molecule (low molecular compounds, etc ) that is chanced bv the Eon 

it ore d ; c i (protein) that is k r n to be functionin9 in a certain iiar ^ ^wssr^ 

[0098] Once the screening reveals that the protein of the invention can regulate cellular conditions or the functions 

a. erinTa Zn^T* ^ *' ? pha(maceutical and 'or associated disease y i«se tf 0r b y 

aitenng a part of it into an appropriate composition. 

[0099] As is above described for membrane proteins, the secretory protein provided by the invention mav be used 
to explore a novel hgand-receptor interaction using a screening based on the binding activity to a L^iqand ^ 

me hods can be a candidate of a compound that can inhibit the interaction between the protein of the invention and 
an interacting molecule (including a receptor). The compounds may be able to use as a preventive therap^utTc Ind 
diagnostic medicine for the diseases, in which the protein may play a certain role. therapeutic, and 

[0100] Proteins associated with signal transduction or transcription may be a factor that affects a certain orotein or 
gene in response to intracel.ular/ex.racellu.ar stimuli. „ is possible to find out a novel facto r 2Sn SSl S J 

toe tTTT 9 ^ Pr ° ,ein Pr ° Vided by ,he inVemi ° n in 3 Certain ^ es of cells * and Perming ^screening Xing 
the activation of a certain intracellular protein or gene. 9 """zing 

[0101] The screening may be performed as follows. First, a transformed cell expressing the protein is obtained Then 
the transf ormed cell line and the untransformed original cell are compared for the changes in thTexprision of a cl^ain 
gene by detecting the amount of its mRNA or protein. Alternatively, the amount of a intraceLTmo^ule Z mo 
lecular compounds), which is changed by the function of a gene product (protein) that is k^w" t funS in a ceZ" 

Z V d f Cti0n ' FUrtherm ° re ' ^ Ch8n9a ° f Session * I ceSn g ne can be 

detected by introducing a fusion gene that comprises a regulatory region of the gene and a marker gene (luciferase 

ssssr c. sissr n9 the protein provided by the inveLn imo me ce "- and ei ~ 

En rln h ,r fl r°, tein " °' *° '""^ " aSS ° Ciated With diseases ' " is P ossible >° s °reen a gene or compound 

[0103] For example, the prote.n of the invention is expressed and the recombinant protein is purified Then the 
protein and gene whose expression was affected in the presence of the protein of the inventio n . a so purifS and 
the binding activity between the two proteins or genes k examined. The examination may 

mart with a compound that is candidate of an inhibitor. In an alternative method, a iLJ^o^Z^Sn 
locating ,n the 5 -upstream of the gene encoding the protein of the invention that is capable of regulating'he express on 
of other genes ,s obtained, and fused with a marker gene. The fusion is introduced into a celf and the ceTfs a^de! 
wrth compounds to explore a regulatory factor of the expression of the said gene 

c?nTL!h h o e H° mP0Und ? tained by the SCrSenin9 Can be Used ,or d eveloping pharmaceutical and diagnostic medi- 
.Lt! h ? . k 63568 WIth Wh ' Ch the Pr ° tein 0i ,he present invention is associated. Similarly, if the reguSv factor 

as a pharmaceutical, and If there is a comnnnnd 
ofoSinhe nrof ma * XpreSS '° n ' eVel and/or «' *e protein, it also can be used for the same purpose 

EEL P ' nVen, ' 0n " aS a " enZymatiC ac,ivtty ' re 9 ard,ess of whethe r * * a secretory proL mem- 

ZoIoT* H f P HH emS aSS0Cia,6d Wtth Si9 " al transducti0 "- glycoprotein, transcription, or diseases, a scrSng Z 
!h7„n I Ttf " 9 f COmP ° Und ,0 the Pr ° tein ° f the inven,i ° n under Citable condition and monZnqtte 
aSoTthe^n° Und - ^ ^ *° * U °' iZed t0 *" a C - po - d «" a ^ « £ 

Sd lien' ™" 9 . 9 J Ven 38 "I eX T' e ' Pr0,ei " °' ,hS inVemi ° n fe eX P ressed and the recombinant protein is 
nrnS compounds are contacted w.th the purified protein, and the amount of the compound and the reaction 

SSe ZZ' ^ Tr ndS th3t are Candida,eS °' a " inhib « or are P~. *H a con^und 
is examinid PlJn, ' ed P ™ Wn " addCd ' ^ ,he am ° Unt °' the substrate and »• reaction products 

[0107] The compounds obtained in the screening may be used as a medicine for diseases with which the orotein of 
me ,nvent,on * associated. Also they can be app^ 
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normally in vivo. 

[0108] Whether the secretory or membrane protein of the present invention is a novel protein associated with dis- 
eases or not is determined in another method than described above, by obtaining a specific antibody against the protein 
of the invention, and examining the relationship between the expression or activity of the protein and a certain disease. 
5 In an alternative way, it may be analyzed referred to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles 
R. edit, (1996) in the series of "Method in Molecular Biology" (Humana Press). 

[0109] Disease-associated proteins are a target of the above described screenings and very useful for developing 
a drug that is capable of regulating the expression and activity of the protein. Also, they are useful in medicinal industry 
as a diagnostic marker of the related disease and as a target for gene therapy. 

10 [0110] Compounds isolated as mentioned above can be administered patients as it is, or after formulated into a 
pharmaceutical composition according to the known methods. For example, a pharmaceutically acceptable carrier or 
vehicle, specifically sterilized water, saline, plant oil, emulsifier, or suspending agent can be mixed with the compounds 
appropriately. The pharmaceutical compositions can be administered to patients by a method known to those skilled 
in the art, such as intraarterial intravenous, or subcutaneous injections. The dosage may vary depending on the weight 

15 or age of a patient, or the method of administration, but those skilled in the art can choose an appropriate dosage 
properly. If the compound is encoded by DNA, the DNA can be cloned into a vector for gene therapy, and used for 
gene therapy. The dosage of the DNA and the method of its administration may vary depending on the weight or age 
of a patient, or the symptoms, but those skilled in the art can choose properly. 

[01 1 1] The protein encoded by the polynucleotide of the invention can be prepared as a recombinant protein or as 

20 a natural protein. For example, the recombinant protein can be prepared by inserting the polynucleotide encoding the 
protein of the invention into a vector, introducing the vector into an appropriate host cell and purifying the protein 
expressed within the transformed host cell, as described below. In contrast, the natural protein can be prepared, for 
example, by utilizing an affinity column to which an antibody against the protein of the invention (Current Protocols in 
Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 16.1-16.19) is attached. The antibody used 

25 for the affinity purification may be either a polyclonal antibody, or a monoclonal antibody. Alternatively, in vitro translation 
(See : for example, "On the fidelity of mRNA translation in the nuclease-treated rabbit reticulocyte lysate system." Dasso 
M.C., and Jackson R.J. (1989) Nucleic Acids Res. 1 7: 3129-3144) may be used for preparing the protein of the invention. 
[01 12] Proteins functionally equivalent to the proteins of the present invention can be prepared based on the activities, 
which were clarified in the above-mentioned manner, of the proteins of the present invention. Using the biological 

30 activity possessed by the protein of the invention as an index, it is possible to verify whether or not a particular protein 
is functionally equivalent to the protein of the invention by examining whether or not the protein has said activity. 
[0113] Proteins functionally equivalent to the proteins of the present invention can be prepared by those skilled in 
the art, for example, by using a method for introducing mutations into an amino acid sequence of a protein (for example, 
site-directed mutagenesis (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, 

35 Section 8.1 -8.5). Besides, such proteins can be generated by spontaneous mutations. The present invention comprises 
the proteins having one or more amino acid substitutions, deletions, insertions and/or additions in the amino acid 
sequences of the proteins of the present invention (Table 370), as far as the proteins have the equivalent functions to 
those of the proteins identified in the present Examples described later. 

[0114] There are no limitations in the number and sites of amino acid mutations, as far as the proteins maintain the 
40 functions thereof. The number of mutations is typically 30% or less, or 20% or less, or 10% or less, preferably within 
5% or less, or 3% or less of the total amino acids, more preferably within 2% or less or 1% or less of the total amino 
acids. From the viewpoint of maintaining the protein function, it is preferable that a substituted amino has a similar 
property to that of the original amino acid. For example, Ala, Val, Leu, lie, Pro, Met, Phe and Trp are assumed to have 
similar properties to one another because they are all classified into a group of non-polar amino acids. Similarly, sub- 
45 stitution can be performed among non-charged amino acid such as Gly, Ser. Thr, Cys, Tyr, Asn, and Gin. acidic amino 
acids such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0115] In addition, proteins functionally equivalent to the proteins of the present invention can be isolated by using 
techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the hybridization 
technique (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, Section 6.3-6.4)), 

50 those skilled in the art can usually isolate a DNA highly homologous to the DNA encoding the protein identified in the 
present Example based on the identified nucleotide sequence (Table 370) or a portion thereof and obtain the functionally 
equivalent protein from the isolated DNA. The present invention includes proteins encoded by the DNAs hybridizing 
with the DNAs encoding the proteins identified in the present Example, as far as the proteins are functionally equivalent 
to the proteins identified in the present Example. Organisms from which the functionally equivalent proteins are isolated 

55 are illustrated by vertebrates such as human, mouse, rat, rabbit, pig and bovine, but are not limited to these animals. 
[0116] Washing conditions of hybridization for the isolation of DNAs encoding the functionally equivalent proteins 
are usually "1xSSC, 0.1% SDS, 37"; more stringent conditions are "0.5xSSC, 0.1% SDS, 42."; and still more stringent 
conditions are "0.1 x SSC, 0.1% SDS, 65.". Alternatively, the following conditions can be given as hybridization con- 
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ditions of the present invention. Namely, conditions in which the hybridization is done at "6xSSC 40% Formamide 
25 L, and the washing at "IxSSC, 55." can be given. More preferable conditions are those in which the hybridization 
It Tl 3 ! ' 6xSSC ' 40% F o""amide, 37.", and the washing at "0.2xSSC. 55.". Even more preferable are those in which 
the hybridization is done at "6xSSC, 50% Formamide, 37 ", and the washing at "O.lxSSC. 62.". The more stringent 
the condmons of hybridization are, the more frequently the DNAs highly homologous to the probe sequence are isolated 
Therefore, it is preferable to conduct hybridization under stringent conditions. Examples of stringent conditions in the 
m ^ nt L 0n are ". washin 9 conditions of "°- 5 *SSC, 0.1% SDS, 42.", or alternatively, hybridization conditions of 
C c™f . ' 37 "' 3nd ,he W3Shing at ° 0 2xSSC . 55 "- However, the above-mentioned combinations of 

SSC, SDS and temperature conditions are indicated just as examples. Those skilled in the art can select the hybridi- 
zation conditions with similar stringency to those mentioned above by property combining the above-mentioned or 
other factors (for example, probe concentration, probe length and duration of hybridization reaction) that determines 
the stnngency of hybridization. 

[0117] The amino acid sequences of proteins isolated by using the hybridization techniques usually exhibit high 
homology to those of the proteins of the present invention, which are shown in Table 370. The present invention en- 
compasses a polynucleotide comprising a nucleotide sequence that has a high identity to the nucleotide sequence of 
claim 8 (a) Furthermore, the present invention encompasses a peptide, or protein comprising an amino acid sequence 
that has a high identrty to the amino acid sequence encoded by the polynucleotide of claim 8 (b). The term "high identity- 
indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 70% or more 
Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthermore. 97% or more' 
or 99 A or more. The identity can be determined by using the BLAST search algorithm 

, ^„ the 96ne ampli,ica,ion technique (PCR) (Current Protocols in Molecular Biology, edit, Ausubel et al 
(1987) John Wiley & Sons, Section 6.3-6.4)) using primers designed based on the nucleotide sequence (Table 370) 
or a portion thereof identified in the present Example, it is possible to isolate a DNA fragment highly homologous to 
the nucleotide sequence or a portion thereof and to obtain functionally equivalent protein to a particular protein identified 
in the present Example based on the isolated DNA fragment. 

l VJ 9 l ""!! ^ e ": e^, id6ntity " °' tW ° amin ° acid seauences ° r of tw> nucleic acids is determined using the algorithm 
of Kariin and Attschul (Proc. Natl. Acad. Sei. USA 87:2264-2268, 1 990), modified as in Karlin and Altschul (Proc Natl 
Acad Se,. USA 90 :5873-5877, 1993). Such an algorithm is incorporated into the BLASTN and BLASTX programs of 
Altschul et al. (J. Mol. Biol.21 5:403-41 0, 1 990). BLAST nucleotide searches are performed with the BLASTN program 
score = 100 wordlength = 12. BLAST protein searches are performed with the BLASTX program, score = 50 word- 
length = 3. When gaps exist between two sequences, Gapped BLAST is utilized as described in Altschul et al (Nucleic 
Adds Res.25:3389-3402,1997). When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g., BLASTX and BLASTN) are used. See http://www.ncbi.nlm nih gov 
[0120] The present invention also includes a partial peptide of the proteins of the invention. The partial peptide 
comprises a protein generated as a result that a signal peptide has been removed from a secretory protein If the 
protein of he present invention has an activity as a receptor or a ligand, the partial peptide may function as a compete 
inhibitor of the protein and may bind to the receptor (or ligand). In addition, the present invention comprises an antigen 
peptide for raising antibodies. Forthe peptides to be specific for the protein of the invention, the peptides comprise at 
least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or more, and even more preferably 
10 amino acids or more. The peptide can be used for preparing antibodies against the protein of the invention or 
competitive inhibitors of them, and also screening for a receptor that binds to the protein of the invention. The partial 
peptides of the invention can be produced, for example, by genetic engineering methods, known methods for synthe- 
sizing peptides, or digesting the protein of the invention with an appropriate peptidase 

[01 21] The present invention also relates to a vector into which the DNA of the invention is inserted The vector of 
the invention is not limited as long as it contains the inserted DNA stably. For example, if E. coli is used as a host 
vec tors such as pBluescript vector (Stratagene) are preferable as a cloning vector. To produce the protein of the in- 
vention expression vectors are especially useful. Any expression vector can be used as far as rt is capable of expressing 
the protein in v.tro, in E. coli, in cultured cells, or in vivo. For example, pBEST vector (Promega) is preferable for in 
vitro expression, pET vector (Invitrogen) for E. coli, pME18S-FL3 vector (GenBank Accession No. AB009864) for cul- 
tured cells and pME18S vector (Mol. Cell. Biol. (1988) 8: 466-472) for in vivo expression. To insert the DNA of the 
invention, ligation utilizing restriction sites can be performed according to the standard method (Current Protocols in 
Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11.4-11 11) 

[0122] The present invention also relates to a transformant carrying the vector of the invention. Any cell can be used 
as a host into which the vector of the invention is inserted, and various kinds of host cells can be used depending on 
«! o^T 5 , 368 F ° r Str °" 9 ex P ression of the P rotein in eukaryotic cells, COS cells or CHO cells can be used for example 
[0123] Introduction of the vector into host cells can be performed, for example, by calcium phosphate precipitation 
method, electroporation method (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons 
Section 9.1-9.9), lipofectamine method (GIBCO-BRL), or microinjection method etc 
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[0124] The primer of the present invention can be used for synthesizing full-length cDNA, and also for the detection 
and/or diagnosis of the abnormality of the protein of the invention encoded by the full-length cDNA. For example, by 
utilizing polymerase chain reaction (genomic DNA-PCR, or RT-PCR) using the primer of the invention, DNA encoding 
the protein of the invention can be amplified. It is also possible to obtain the regulatory region of expression in the 5'- 

5 upstream by using PCR or hybridization since the transcription start site within the genomic sequence can be easily 
specified based on the 5'-end sequence of the full-length cDNA. The obtained genomic region can be used for detection 
and/or diagnosis of the abnormality of the sequence by RFLP analysis, SSCP, or direct sequencing. 
[0125] Furthermore, the "polynucleotide having a length of at least 15 nucleotides, comprising a nucleotide sequence 
that is complementary to a polynucleotide comprising the nucleotide sequence set forth in any one of SEQ ID NOs in 

w Table 370, or its complementary strand" includes an antisense polynucleotide for suppressing the expression of the 
protein of the invention. To exert the antisense effect, the antisense polynucleotide has a length of at least 15 bp or 
more, for example, 50 bp or more, preferably 1 00 bp or more, and more preferably 500 bp or more, and has a length 
of usually 3000 bp or less and preferably 2000 bp or less. The antisense DNA can be used in the gene therapy of the 
diseases that are caused by the abnormality of the protein of the invention (abnormal function or abnormal expression). 

15 Said antisense DNA can be prepared, for example, by the phosphorothioate method ("Physicochemical properties of 
phosphorothioate oligodeoxynucleotides." Stein (1988) Nucleic Acids Res. 16: 3209-3221) based on the nucleotide 
sequence of the DNA encoding the protein (for example, the DNA set forth in any one of SEQ ID NOs in Table 370). 
[0126] The polynucleotide or antisense DNA of the present invention can be used in gene therapy, for example, by 
administrating it into a patient by the in vivo or ex vivo method with virus vectors such as retrovirus vectors, adenovirus 

20 vectors, and adeno-associated virus vectors, or non-virus vectors such as liposome. 

[01 27] The present invention also relates to antibodies that bind to the protein of the invention . There are no limitations 
in the form of the antibodies of the invention. They include polyclonal antibodies, monoclonal antibodies, or their portions 
that can bind to the protein of the invention. They also include antibodies of all classes. Furthermore, special antibodies 
such as humanized antibodies are also included. 

25 [0128] The polyclonal antibody of the invention can be obtained according to the standard method by synthesizing 
an oligopeptide corresponding to the amino acid sequence and immunizing rabbits with the peptide (Current Protocols 
in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11 .12-11 .13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the protein expressed in E. coli, immunizing 
mice with the protein, and producing a hybridoma cell by fusing the spleen cells and myeloma cells (Current Protocols 

30 in Molecular Biology (1987) Ausubel et al. edit, John Wily & Sons, Section 11.4-11.11). 

[0129] The antibody binding to the protein of the present invention can be used for purification of the protein of the 
invention and also for detection and/or diagnosis of the abnormalities of the expression and structure of the protein. 
Specifically, proteins can be extracted, for example, from tissues, blood, or cells, and the protein of the invention is 
detected by Western blotting, immunoprecipitation, or ELISA : etc. for the above purpose. 

35 [0130] Furthermore, the antibody binding to the protein of the present invention can be utilized for treating the dis- 
eases that associates with the protein of the invention. If the antibodies are used for treating patients, human antibodies 
or humanized antibodies are preferable in terms of their low antigenicity. The human antibodies can be prepared by 
immunizing a mouse whose immune system is replaced with that of human (" Functional transplant of megabase human 
immunoglobulin loci recapitulates human antibody response in mice" Mendez M.J. et al. (1997) Nat. Genet. 15: 

40 1 46-156). The humanized antibodies can be prepared by recombination of the hypervariable region of a monoclonal 
antibody (Methods in Enzymology (1991) 203: 99-121). 

[0131] The present invention further relates to databases comprising at least a sequence of polynucleotide and/or 
protein, or a medium recorded in such databases, selected from the sequence data of the nucleotide and/or the amino 
acids indicated in Table 370. The term "database" means a set of accumulated information as machine-searchable 

^5 and readable information of nucleotide sequence. The databases of the present invention comprise at least one of the 
novel nucleotide sequences of polynucleotide provided by the present invention. The databases of the present invention 
can consist of only the sequence data of the polynucleotide provided by the present invention or can comprise other 
information on nucleotide sequences of known full-length cDNAs or ESTs. The databases of the present invention can 
be comprised of not only the information on the nucleotide sequences but also the information on the gene functions 

50 revealed by the present invention. Additional information such as names of DNA clones carrying the full-length cDNAs 
can be recorded or linked together with the sequence data in the databases. 

[0132] The database of the present invention is useful for gaining complete gene sequence information from partial 
sequence information of a gene of interest. The database of the present invention comprises nucleotide sequence 
information of full-length cDNAs. Consequently, by comparing the information in this database with the nucleotide 
55 sequence of a partial gene fragment yielded by differential display method or subtraction method, the information on 
the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting clue. 
[0133] The sequence information of the full-length cDNAs constituting the database of the present invention contains 
not only the information on the complete sequences but also extra information on expression frequency of the genes 
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S3isr, to t rr and known pro,eins - Thus ,he e * ra informati ° n ■«■— ^ 

of 2 orient ElJ^ESSTS ^ ^ inf0,niati0n °" hUman geneS is accu ™'^ in the database 
f^o^T^I! T' ! k / he d3tabaSe ,S USefUl for iso,atin 9 a huma " homologue of a gene originating 
orig^ spedeT ^ h ° m0l ° 9Ue ^ *° lated ta " d °" ,he nUC,eo,ide s ^ a "<» - the gene SZ he 



as an imtial clue If the > full-length of the gene .s not available, the amino acid sequence of the protein encoded bv the 

-2, ' , ' ,SMCir,!a '" , "* h «"™'<°^*°>^™*™°>»c™<™> 



EXAMPLE 1 



Construction of a cDIMA library by the oligo-capping method. 



(1) NT-2 cells were cultured without induction by retinoic acid treatment ((NT2RMf NT2RM2 NT2RM4M 

2 After cu .red, NT-2 cells were induced by adding retinoic acid, and then were cu ItuS for 4 Ss (NT 2 RPn 

sssr were induced by addins retinoic acid - and then were cu,tured f ° r 2 *s 

SK-N^MC ^^J^SSSSS^ ^ ' ine H4 (ATC ° HTG ' 148) (BNGH41) ' human "^roblastoma cel. line 

st^-csaar spring Ha * or Laborat °^ press> - Furth ~ p^^^ssi 

roTS! ' P0,V(A)+RNA ^ PUrmed fr0m the "™ ^ 9 ^!2ose ^ 

Suga 9 o s ^CilTse ^ u ""I" o a CDNA " bra,y bV the 0,i90 Ca ^ in 9 method M. and 

ID NO 25a??h»S. : I n9 ,he ° ,,90 " Cap linker (SEQ 10 N0: 2541 ) the Oligo-dT pruner (SEQ 

analyzed with a DNA seauencer (AB\ pri^m ^77 dc d M , * °' ine 0 ana 3 " ends of the cDNA clones was 
tho hma sequencer (A«l PRISM 377, PE Biosystems after sequencing reactions were DerformPri with 

the DNA sequencmg reagents (Dye Terminator Cycle Sequencing FS Ready ReacL Ki dRhnrmino T ♦ 
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of the cloning site, and SV40 polyA added signal sequence site in the downstream. As the cloning site of pME1 8SFL3 
has asymmetrical Dralll sites, and the ends of cDNA fragments contain Sfil sites complementary to the Dralll sites, 
the cloned cDNA fragments can be inserted into the downstream of the SRa promoter unidirectionally. Therefore, 
clones containing full-length cDNA can be expressed transiently by introducing the obtained plasmid directly into COS 
5 cells. Thus, the clones can be analyzed very easily in terms of the proteins that are the gene products of the clones, 
or in terms of the biological 

[0141] Herein, the cDNA libraries and the name of each clone are related as shown in Table 2. Therein, "xxxxxx" 
represents the clone number of six digits. Thus, the sequences are named by the library name, the clone number plus 
F- for the 5'-end s or R- for the 3'-end. 

10 

Table 2 

15 



20 



25 



30 



35 



40 



45 



50 
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library: 

clone 5'-end sequ ence 3'-end sequence 



NT2RM1: 

NT2RM1 xxxxxx F-NT2RM I xxxxxx 
NT2RP1: 

NT2RP1 xxxxxx F-NT2R?lxxxxxx 
NT2RM2: 

NT2RM2xxxxxx F-NT2RM2xxxxxx R-NT2RM2xxxxxx 
NT2RM4: 

NT2RM4XXXXXX F-NT2RM4xxxxxx R-NT2RM4xxxxxx 
NT2RP2: 

NT2RP2xxxxxx F-NT7RP2xxxxxx R-NT2RP2xxxxxx 
NT2RP3: 

NT2RP3xxxxxx F-NT2RP3xxxxxx R-NT2RP3 xxxxxx 
NT2RP4: 

NT2RP4 xxxxxx F-NT2RP4xxxxxx R-NT2RP4xxxxxx 
BNGH41: 

BNGH4 1 xxxxxx F-BNGH4 1 xxxxxx R-BNGH41 xxxxxx 
SKNMC1; 

SKNMC 1 xxxxxx F-SKNMC 1 xxxx xx R-SKNMCl xxxxxx 
Y79AA1: 

Y79AA 1 xxxxxx F- Y79 A A 1 xxxxxx R-Y79AA 1 xxxxxx 
PLACE1: 

PLACElxxxxxx F-PLACElxxxxxx R-PLACE1 xxxxxx 
PLACE2: 

PLACE2 xxxxxx F-PLACE2xxxxxx R-PLACE2xxxxxx 
PLACE3: 

PLACE3xxxxxx F-PLACE3xxxxxx R-PLACE3xxxxxx 
OVARC1: 

OVARClxxxxxx F-OVARClxxxxxx R-OVARC1 xxxxxx 
HEMBA1: 

HEMBAlxxxxxx F-HEMBA1 xxxxxx R-HEMBAIxxxxxx 
HEMBB1: 

HEMBBlxxxxxx F-HEMBB I xxxxxx R-HEMBB1 xxxxxx 
MAMMA 1: 

MAMMA J xxxxxx F-MAMMA 1 xxxxxx R- MAMMA ! xxxxxx 
THYROI: 
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F-THYROlxxxxxx R-THYROlxxxxxx 



5 

EXAMPLE 2 

Estimation of the fullness ratio of the 5'-ends of the clones contained in the cDNA libraries constructed by the oligo- 
capping method. 

10 

[0142] The fullness ratio at the 5'-end sequences of the 59,823 clones in the human cDNA libraries constructed by 
the oligo-capping method was determined as follows. Of all the clones whose 5'-end sequences were found in those 
of known human mRNA in the public database, a clone was judged to be "full-length", if it had a longer 5'-end sequence 
than that of the known human mRNA, or, even though the 5*-end sequence was shorter, if it contained the translation 
*5 initiation codon. A clone which did not contain the translation initiation codon was judged to be "non-full-length". The 
fullness ratio ((the number of full-length clones)/(the number of full-length and non-full-length clones)) at the 5'-end of 
the cDNA clones from each library was determined by comparing with the known human mRNA. As a result, the fullness 
ratio of the 5'-ends was 63.5%. It suggests that the human cDNA clones obtained by the described method have 
complete 5'-ends with high probability. 

20 

EXAMPLE 3 

Assessment of the fullness ratio of the 5'-end of the cDNA by the ATGpr and the ESTiMateFL. 

25 [0143] The ATGpr, developed by Salamov A. A., Nishikawa T, and Swindells M.B. in the Helix Research Institute, is 
a program for prediction of the translation start codon based on the characteristics of the sequences in the vicinity of 
the ATG codon. The results are shown with expectations that an ATG is a true start codon (0.05-0.94). When this 
program is applied to general cDNAs without considering whether or not the ATG codons in the cDNAs are the true 
initiation codons of the cDNAs, both the sensitivity and the specificity of the results are estimated at 66%. Here, the 

30 sensitivity means the ratio of the number of codons judged to be initiation codons by the program to the total number 
of true initiation codons, and the specificity means the ratio of the number of true initiation codons to the number of 
codons judged to be initiation codons by the program. In contrast, when the program was applied to the 5' -end se- 
quences of the clones from the cDNA library that was obtained by the oligo-capping method and that had 65% fullness 
ratio, the sensitivity and specificity of evaluation of a full-length clone (clone containing the N-terminal end of ORF) 

35 were improved to 82-83% by selecting only clones having the ATGprl score 0.6 or higher. 

[0144] Furthermore, the program was used to assess the fullness of 18,959 clones in the human cDNA libraries 
obtained here, which have 5'-ends matched to a known human mRNA. Briefly, the maximal ATGprl score of the clones 
was determined, and then their 5' -end sequence was compared with the known human mRNA to estimate whether 
the clone is full-length or not. The result was summarized in Table 3. Based on the knowledge that known mRNAs, in 

40 general, are highly expressed in the cell, the expression levels of genes having a low number in the EST hit, which 
represent mRNAs whose expression levels are relatively low were examined, and the result is shown in Table 4. 
[0145] In the table, the number of full-length clones indicate that of clones containing the N-terminal end of ORF, 
and so does the number of non-full-length clones that of clones without the N-terminal end of ORF. The fullness ratio 
represents (the number of full-length clones)/(the number of full-length clones plus the number of non-full-length 

45 clones). 



Table 3 



The maximal ATGprl score and the fullness ratio of the 5'-end sequences of clones obtained from human cDNA 
libraries constructed by the oligo-capping method; clones having a matched 5'-end with that of a known human 
mRNA. 


maximal ATGprl score 


number of (full-length 
clones plus non-full-length 
clones) 


number of full-length 
clones 


fullness ratio 


>=0.70 


11,193 


9,346 


83.5% 


>=0.50 


13,369 


10,549 


78.9% 


>=0.30 


15,489 


11,340 


73.2% 
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Table 3 (continued) 



The maximal ATGprl score 
libraries constructed by the 
mRNA. 


i and the fullness ratio of the 5'-end sequences of clones obtained from human cDNA 
oligo-capping method; clones having a matched 5'-end with that of a known human 


maximal ATGprl score 


number of (full-length 
clones plus non-full-length 
clones) 


number of full-length 
clones 


fullness ratio 


>=0.15 
>=0.00 


17^94 
18,959 


11,811 

12,046 


6^9% 
63.5% 



Table 4 



The maximal ATGprl score 
libraries constructed by the 
matched 5*-end with that o 


' and the illness ratio of the 5'-end sequences of the clones obtained from human cDNA 
* oli 9o-capping method; clones having 5 EST hits or less among the clones having a 
f a known human mRNA. 


maximal ATGprl score 


number of (full-length 
clones plus non-full- 
length clones) 


number of full-length 
clones 


fullness ratio 


>=0.70 
>=0.50 
>=0.30 
>=0.15 
>=0.00 


2,801 
3,683 
4,683 
5,559 
6,113 


1,934 
2,393 
2,707 
2,890 
3,013 


69.0% 
65.0% 
57.8% 
52.0% 
49.8% 



[0146] The ESTiMateFL, developed by Nishikawa and Ota in the Helix Research Institute, is a method for the se- 
dmabase * ^ ^ ^ ^ COmparing with the 5 '- end or 3 '- end sequences of ESTs in the public 

[0147] By the method, a cDNA clone is judged presumably not to be full-length if there exist any ESTs which have 

XlriTJ °\ n^^TT th3n ,he C ' 0ne - The meth0d is s V stematized ^ high throughput analysis. A done 
is judged to be full-length ,f the clone has a longer 5'-end sequence than ESTs in the public database. Even if a clone 

tZ a J°T, ' I "T d M ,he f? 6 iS jUd9ed ,0 bS ,U "'' en9th " the difference in len 9 th is wi,hin 50 ba ses, and otherwise 
£ t0 t ^ 0 U ""'?: 9th ' ,0r convenience - case of the 5'-end sequence of the clones which matches a known 
mRNA about 80 /„ of the sequences that were judged to be full-length by comparing with ESTs was judged to be full- 
length by est.mat.ng the 5'-end sequence, as well; about 80% of the sequences that were judged to be not full-lenqth 
by comparing with ESTs was judged to be not full-length by estimating the 5'-end sequence, as well. The accuracy of 
the prediction by comparing cDNA clones with ESTs is improved with increasing number of ESTs to be compared 
However, when only a limited number of ESTs are available, the reliability becomes low. Thus, the method is effective 
in excluding clones wrth high probability of being non-full-length, from the cDNA clones that is synthesized by the oliqo- 
cappmg method and that have the 5"-end sequences with about 60 % fullness ratio. In particular, the ESTiMateFL is 
eftaently used to estimate the fullness ratio at the 3'-end sequence of cDNA of a human unknown mRNA which has 
a significant number of ESTs in the public database. 

[0148] The 1 8,959 clones isolated from human cDNA libraries constructed by the oligo^apping method, which have 
he 5 -end sequence that matches a known human mRNA, were estimated by using the ATGpr and ESTiMateFL Briefly 
the 5 -end sequence that matches a known human mRNA of the respective clone was ana^zed to obtain the maximal 
ATGpr score, and compared with the ORF of the known human mRNA that matches it to determine whether the done 
is full-length or not. Then, the 5"-end sequence of the respective clone was analyzed by the ESTiMateFL to judge 
whether he clone .s full-length or not. Specially, the 5'-end sequences that match a known human mRNA of the 
18,959 clones constructed by the oligo-capping method were compared with those of ESTs by the ESTiMateFL and 
the clones other than those that are not full-length were selected. Then, the selected clones were used to analyze the 
relationship between the ATGpr and the fullness ratio. The result was summarized in Table 5. Also, among the selected 
theclone S ,nwh,chthenumberoftheE^ 

in Table 6, which represents the result of the analysis of mRNA with relatively low abundance 

[0149] In the Tables, the number of full-length clones, the number of non-full-length clones, and the fullness ratio 

indicate the number of the clones that contain the N-terminus of the ORF, the number of the clones that do not contain 
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the N-terminus of the ORF, and (the number of full-length clones)/(the number of full-length clones) plus (the number 
of non-full-length clones), respectively. 



Table 5 



The maximal ATGprl score and the fullness ratio of the 5'-end sequence in the clones isolated from human cDNA 


libraries constructed by the oligo-capping method, which have the 5'-end sequence that matches a known human 


mRNA, and also other than those being not full-length according to the comparison with ESTs. 


maximal ATGprl score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full- 


clones 






length clones) 






>=0.70 


9,068 


8,349 


92.1% 


>=0.50 


10,345 


9,318 


90.1% 


>=0.30 


1 1 ,425 


9,964 


87.2% 


>=0.15 


12,254 


1 0,335 


84.3% 


>=0.00 


12,785 


1 0,484 


82.0% 



Table 6 



The maximal ATGprl score and the fullness ratio of the 5'-end sequence in the clones isolated from human cDNA 


libraries constructed by the oligo-capping method, which have the 5'-end sequence that matches a known human 


mRNA, and also other than those being not full-length according to the comparison with ESTs, in which the number 


of the EST hit is not more than 5. 






maximal ATGprl score 


number of (full-length 


number of full-length 


fullness ratio 




clones plus non-full- 


clones 






length clones) 






>=0.70 


1,959 


1,510 


77.1% 


>=0.50 


2,469 


1,821 


73.8% 


>=0.30 


2,975 


2,046 


68.8% 


>=0.15 


3,368 


2,164 


64.3% 


>=0.00 


3,661 


2,226 


60.8% 



[0150] According to the above results, it was found that, in case of using clones isolated from human cDNA libraries 
constructed by the oligo-capping method, the fullness ratio of the clones that have low score in the ATGpr can be 
improved by assessing their 5*-end sequence using the combination of the ATGpr and the ESTiMateFL Therefore, the 
method was applied to select a cDNA clone with high fullness ratio. 



EXAMPLE 4 

Clustering of the 5'-end and 3'-end sequences of cDNA clones. 

[0151] The 5'-end and 3'-end sequences of cDNA clones were obtained, and clustered separately. Briefly, data of 
the single pass sequencing of the determined 5'-end and 3'-end of cDNA clones was subjected to the BLAST search 
between the sequence data of all the clones synthesized in Example 1 , and clones that are supposed to be originating 
from the same gene were clustered into a group. For the 5' -end sequence, those having the consensus sequence of 
95% identity 300 base pairs or more are clustered into the same group. For the 3'-end sequence, those having the 
consensus sequence of 90% identity 200 base pairs or more are clustered into the same group. Among the clusters 
of the 5'-end and 3'-end sequences, the sequence having the longest lead was chosen as the representative sequence 
of the cluster (group). 

EXAMPLE 5 

Characterization of the representative sequences and the sequences of clones 

[01 52] Data of the 5*-end sequences of the representative sequences and clones was characterized by the following 
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methods: 

(1) judging whether It is identical to the sequence of mRNA or ESTs from human by the BLAST search of the 
GenBank or SwissProt, and examining whether it is full-length by comparing with the sequences of known mRNA 
and ESTs from human. 

(2) determining the ATGprl score using all the initiation codons contained within the 5'-end sequence by the ATGpr 
which predict fullness ratio. 

(3) predicting the existence of the signal sequence using all the initiation codons contained within the 5'-end se- 
quence by the PSORT which predict signal. 

and, 

(4) only with the 5'-end sequences of the representative sequences of the clusters, examining the keywords in the 
top hit data of the homology search of the SwissProt. 

[01 53] Data of the characterized representative sequences and clones was used for the final selection of the clones. 
EXAMPLE 6 

Identity to the human mRNA and human EST, and comparison of the 5'-end length. 

[0154] The clones and the representative sequences of the clusters were judged to be identical to any human mRNA 
if their 5'-end sequence has a region of 200 nucleotides or longer with 94% or more identity to the mRNA The clones 
and the representative sequences of the clusters were judged to be identical to any human EST if their 5'-end sequence 
has a region of 200 nucleotides or longer with 90% or more identity to the EST 

[0155] The clones and the representative sequences of the clusters were judged to be full-length in comparison with 
human mRNA, if their 5'-end sequence is longer than those of the mRNA, or it contains the translation initiation site 
The clones and the representative sequences of the clusters were judged to be full-length in comparison with human 
EST in the database, if their 5'-end sequence is longer than those of the EST, or even though it is shorter the difference 
in length between the two sequences is 50 nucleotides or less, for convenience. Otherwise, the clones and the repre- 
sentative sequences of the clusters were judged to be not full-length. 

EXAMPLE 7 

Prediction of the fullness ratio by the ATGpr. 

[01 56] The score in the ATGprl is the expectation to be full-length based on calculations, and the higher score reflects 
the higher fullness ratio as shown in Example 3. Further, the maximal ATGprl score represents the score obtained 
with all the initiation codons contained in the 5'-end sequence of the clones and the representative sequences and 
are used for the characterization. 

EXAMPLE 8 

Prediction of the existence of a signal sequence by the PSORT. 

[0157] Prediction of the existence of a signal sequence by the PSORT was performed on all of the amino acid se- 
quences predicted from all the initiation codons in the 5'-end sequence of the clones and the representative sequences 
of ihe clusters. By analyzing the presence or absence of the sequence which is predicted to be a signal sequence 
which is characteristics of the N-terminus of many secretory proteins, cDNA clones encoding a secretory protein or 
membrane protein were selected. 

EXAMPLE 9 

Prediction of the protein function by the BLAST search. 

[01 58] The 5'-end sequence of the representative sequences of the cluster was analyzed by the BLAST homology 
search of the SwissProt. The obtained top hit data was classified into those identical to the 5'-end representative 
sequence (identity was 90% or higher), those not identical to the 5"-end representative sequence (identity was 60% or 
lower, and compared sequence was not more than 25 nucleotides), and those similar to the S-end representative 
sequence (the rest of the data). 
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[0159] All the keywords in the SwissProt data corresponding to the top hit data were selected, and the 5'-end rep- 
resentative sequences were classified by the keywords relating with functions. 
The keywords relating with a secretory protein or membrane protein are the followings: 

5 growth factor, 

cytokine, 
hormone, 
receptor, 

G-protein coupled receptor, 
10 ionic channel, 

voltage-gated channel, 

calcium channel, 

extracellular matrix, 

transmembrane, and 
*5 signal. 

[0160] The keywords relating to glycoprotein is glycoprotein. 

[0161] The keywords relating to signal transduction are the followings: 

20 serine/threonine-protein kinase, 

tyrosine-protein kinase, and 
calmodulin-binding. 

[0162] The keywords relating to transcription are the followings: 

25 

transcription regulation and activator, 
transcription regulation and repressor, and 
nuclear protein and repressor. 

30 [0163] The keywords relating to diseases are disease mutation, and syndrome. 

[0164] Many keywords overlapped in the respective group (receptor and transmembrane, for example), and some 
keywords overlapped in different groups (secretory or membrane, and diseases, etc.). 

EXAMPLE 10 

35 

Selection of clones by characterization. 

[0165] From the data obtained by the above characterization, clones encoding a novel secretory protein or membrane 
protein, or proteins with other predicted functions were selected by the combination of the ATGprl score and the 

40 prediction of the signal sequence by the PSORT, or according to the top hit data in the homology search of the SwissProt. 
[0166] In selecting the clones, the 5'-end sequences that are identical to a human mRNA were ignored, whereas 
those that are identical to a human mRNA in part but obviously not identical in the other part were included. Because 
there were clones selected that are identical to a human mRNA in part but obviously not identical in the other part. 
[0167] Also, if the finally selected clones were found to be not full-length compared with the sequences of human 

45 mRNA and ESTs, these clones were discarded. 

EXAMPLE 11 

A method for selection of clones by the combination of the ATGprl score and the prediction of the signal sequence by 
50 the PSORT (a method for selection of secretory proteins and membrane proteins that are novel and full-length). 

[01 68] The sequences of clones and the representative sequences of their clusters were used to obtain the maximal 
ATGprl score and predict the presence of the signal sequence. First, clones were selected based on the representative 
sequences of the clusters. The correspondence between the name and SEQ ID of the representative sequences used 
55 for selection (Table 368), and the correspondence between the name and SEQ ID of the introns (including the repre- 
sentative sequences of the 5'-end and 3'-end, and ESTs) used for selection of clones from the representative sequences 
of the groups (Table 369) were shown in the last part of the present specification. Therein, HRIFA and HRIRA indicate 
the representative sequence of the 5'-end group, and that of the 3' -end group, respectively. 
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£1? ?! C i" Ste ^ Which a single clone is comained ( ,he se ^ence of the 5'-end clone = the representative 
and F<?Tfe 1 ,T 1 S6leC ' ed W6re the C ' 0neS ,h3t Were judged 10 be fulMength in comparison with human mRNA 
h! " 1 ? ? 6 maX ' mal ATGPM SC ° re 0 5 ° r hi9her ' and ? redicted »° ^in the signal sequence in principle 
,h«n oT'th Wm9 , C f eS ' 8 C ' 0ne haVin9 3 ,0n9er 5 '- end Was se,ec,ed: ,he ATGprl score was .ess 

SassLd nf n r UenCe ? ^ 5 '" end W3S n0t fU,Wen9th - ,he C,0ne was obvious, y shorter though the clone was no 
inc^o tt!e <v °» ^ f th 16 ^h* ^ acc o't'ing to the BLAST search of the other clones, or the 5'-end sequence correspond- 
ed til I k ° ther K C ° nes in the same cluster in ""idi the 3--end sequence of the done was contained was 
found to be longer by assembhng. Furthermore, if there were multiple full-length clones in the same cluster and I wa! 

tt sz^ss f ^ w r has the ,on9er 5, - end - a " ,h? ci ° nes — sej::z;^:i 

he Sequenchert ) (Hrtach. Soft Eng.neenng ) was used. As a result, the signal sequence predicted to be present in 
the representee sequence was not present in some of the selected clones. In some cases, the ATGprl score became 

5122? A? f "" neSS ra,i ° " ,h6Se C,0n6S WaS l0W " yet S,i " « is P ossible *■ ^e clones SSSST 
The clones ,n which the signal sequence predicted to be present in the representative sequence was not presenter 
seect,on were likely -to be without the signal sequence, bu, still Ml.po^ihmttactone.^arZLZSiS 
0170 In the clusters comprising multip le clones, in which the representative sequence of the 5"-end was prSed 

wer SSST' S6,eCt ? ^ C '° neS haVm9 l0n9eSt 5 "- d -quence among the donefJh! 
were judged to be full-length compared with human mRNA and ESTs, having the maximal ATGprl score for the 5'-end 
sequence 0.5 or h.gher, and predicted to contain the signal sequence. However, in the following cases a done having 

InTZT ^ ^ maXima ' ATGpr1 SC ° re WaS ,6SS ,ha " °- 5 ' *° »f 'he 5--end was not f ! 

bSLt II JX r °h 7* a ' th0U9h ,he C '° ne WaS n0t c,assified imo the sa ™ duster according to the 

d^ter in Si Z v « " "? corres P onding to the *+>« of the other clones in the same 

mdt e J te 1 1'' nd Se , q h Uence 0f ,he clone was stained was found to be .onger. Furthermore, if there were 
mult,ple full-length clones ,n the same cluster and it was not successful to determine by assembling which has the 

sequence was not present in some of the selected clones. In some cases, the ATGprl score became smaller tha^ 5 

whf,h , h r S ° ln th6Se C ' 0neS W3S ' 0W ' y6t Sti " » iS P° Ssible that « he c, °n« are full-length The clones in 
2L . k 9 T Seq , UenCe PrediCted 10 bS Present in the ^P^sentative sequence was not present atter sekction we e 

01 71 0 nZTZ e : 9 T sequence at the 5 '- end ' but s,iH * is possibie mat the ci ° nes e -° de a -embranTp;:;; 6 

Sid JL Tt COmpnSln9 mU ' ,iple C ' 0neS ' in Whk:h ,he representative sequence of the 5'-end was 

rex" SeqUenCe ' Se ' eCted W6re ,he C, ° neS Which were i udged 10 be full-length compared wfth 
c™ ™u T :nce aVm9 ^ ATGPM SC ° re 5 '" end " «* ^ a " d - 

[0172] The number of the clones selected by the combination of the ATGprl score and the prediction of a sional 

Zicted to" : PSORT T 6 254 - ™ e °' ,he d ° neS Having the maximal ATG ^ score 0 5 or hte S 

Zt^Z 8 S ' 9na f qUenCe W6re 1 70 (Tab ' e 7 " 1 0) - Among ,he c,ones ' 1 64 cl °"cs were found to have tne 

representative sequence of the original cluster that fulfills the same conditions. On the other hand 5 clones were 

selected from the representative sequences of the 5--end of the clusters which was predicted cental a TnaHe 
quenoc | whrte the maxima. ATGprt score was lower than 0.5. A clone was selected from the representee sequence 
of the 5 -end of the cluster which was predicted to have no signal sequence sequence 
[01 73] The clones that have the maximal ATGprl score 0.3 or higher and less than 0.5 and predicted to contain the 
signal sequence were 35 clones (Table 11 ), in which 8 clones were found to have the representative7equ^ct Z 2 
ong,naclusterthatfulfillsthesameconditions. Twenty-seven clones were selected from 

r01741 ^pTi thTh r ima ' ATGf>r1 SC ° re 0 5 ° r higher and were P redicted to have no signal sequence 
were 41 clones (Table 1 2). The clones that have the maximal ATGprl score 0.5 or higher and were predicted to have 
no s,gnal sequence were 4 clones (Table 1 3). The clones that have the maximal ATGprl score 0 3 or higher and ,ess 
than 0.5 and were predicted to have no signal sequence were 2 clones (Table 14). The clones that have the m^mal 
ATGprl score .ess than 0.3 and were predicted to contain a signal sequence were 2 Cones (Table 15 ^ T^e represent 

ZZ£tt™ittttt maximal ATGpr1 score 05 or hi9her> 03 or hi9her ' and 0 or hi9her 
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The 170 clones in which the selected clones have the maximal ATGprt score 0.5 or higjer, 
and were predicted to contain a signal sequence by the PSORT 



name of clone iname of sequence 


IlldXIItldl ! 


signal 


name of representative [maximal 


signal 






^ i opr i ocof e 




sequence i 


VTGpri score 




HEMBA1000713 


F-HEMBA1000713 


U.D / 


Yes 


ft *Mf 1* A w M AH M 

HRIFA01 7729a 


0.57 


Yes | 


HE MB A 1000962 


F-HEMBA1 00096* 


ft en 


Yes 


HRIFA000899a 


0.69 


Yes 


HEMBA1 001272 


F-HEMBA 100 1272 


U.y4 


Yes 


HRIFA001 179a 


0.94 


Yes 


HEMBA100I297 


F-HEMBA10D1297 


ft OQ 


Yes 


HRIFA001201a 


0.89 


Yes 


HEMBA1 002420 


F-HEMBA1 002420 


n a 

0.6 


Yes 


HRIFA002195a 


0.6 


Yes 


HEMBA1003101 


F-HEMBA100310I 


0.67 


Yes 


HRIFA002787a 


0.94 


Yes 


HEMBA1 003399 


F-HEMBA 1003399 


u.94 


Yes 


HRIFA002985a 


0.94 


Yes 


HEMB A 1003732 


F-HEMB A 100373;? 


0.86 


Yes 


HRIFA003169a 


086 


Yes 


HEMBA1004110 


F-HEMBA1004110 


0.59 


Yes 


HRIFA003379a 


0.59 


Yes 


HEMBA1 005430 


F-HEMBA 1005430 


0.69 


Yes 


HRIFA020661a 


0.69 


Yes 


HEMBA 100601 6 


F-HEMBA1006016 


0.6 


Yes 


HRIFA020466a 


0.6 


Yes i 


HEMBA1G06171 IF-HEMBA10O617I 


0.62 


Yes 


HRIFA021399a 


0.62 


Yes 


HEMBA1006311 F-HEMBA 1006311 


0.94 


Yes 


HRIFA021594a 


0.94 


Yes 


HEMBA 1006335 ! F-HEMB A1 006335 


0.83 


Yes 


HR1FA01 2069a 


0.94 


Yes 


HEMBA1 006357 1 


F-HEMB A1 006357 


0.67 


Yes 


HRIFA021448a 


0.67 


Yes 


HEMBA 1006658 1 


F-HEMB A1 006658 


0.66 


Yes 


HRIFA021323a I 


0.66 


Yes 


HEMBA 1006707 | 


F-HEMBA 1006707 


0.66 


Yes 


HRIFA02l499a 


0.94 


Yes 


HEMBA1 006902 I F-HEMBA1 006902 


066 


Yes 


HRIFA021754a 


0.94 


Yes 


HEMBA 1006960 


F-HEMB A1 006960 


0.94 


Yes 


HRIFA021886a 


094 


Yes 


HEMBB 1000276 


F-HEMBB 1000276 


0.94 


Yes 


HRIFA029577a 


094 


Yes 


HEMBB 1000642 


F-HEMBB 1000642 


0.94 


Yes 


HRlFA029779a 


0.94 


Yes 


HEMBB 1000905 


F-HEMBB 1000905 


0.94 


Yes 


HRIFA009764a ! 0.91 


Yes 


HEMBB 100 1200 


F-HEMBB 1001 200 


0.83 


Yes 


HRIFA030839a 


0.81 


Yes 


HEMBB 100 1407 


F-HEMBB 1001 407 


0.87 


Yes 


HRIFA030981a 


0.87 


Yes 


HEMBB1001530 


F-HEMBB1001530 


0.6 


Yes 


HRIFA031062a 


06 


Yes 


HEMBB1001547 


F-HEMBB 100 1547 


0.87 


Yes 


HR!FA031075a 


0.87 


Yes 


HEMBB1001978 


F-HEMBB 1001 973 


0.7 


Yes 


HRfFA031350a 


0.7 


Yes 


HEMBB1002162 


F-HEMBB1002162 


0.91 


Yes 


HRIFA031472a 


0.91 


Yes 


HEMBB 1002228 


F-HEMBB 1002223 


0.53 


Yes 


HRlFA031510a 


0.53 


Yes 


HEMBB 1002245 


F-HEMBB1002245 


0.94 


Yes 


HRIFA032984a 


0.94 


Yes 


HEMBB1002427 


F-HEMBB 100242 7 


F 6.57 


Yes 


HRJFA005760a 


094 


Yes 


HEMBB 1002465 


F-HEMBB 100246 5 0.72 


Yes 


HRIFA031672a 


0.72 


Yes 


HEMBB1002693 


F-HEMBB1002693 : 0.64 


Yes 


HRIFA031895a 


0.64 


Yes 


MAMMA 1000046 


F-MAMMA 1000046 1 07 


Yes 


HRIFA024S41a I 07 


Yes 


MAMMA1000102 


F-MAMMA 10001 02 1 0.79 


Yes 


HR]FA026151a 


0.79 


Yes 


MAMMA1000118 


F-MAMMA 1000 118 0.81 


Yes 


HRIFA026153a 


i 0.81 


Yes 


MAMMA1000141 


F-MAMMA1000141 


0.8 


Yes 


HRIFA024554a 


[ 0.8 


Yes 


MAMMA 1000449 


F-MAMMA 1 000449 


0.94 


Yes 


HRIFA026203a 


0.94 


Yes 


MAMMA 1000457 


F-MAMMA1 000457 


078 


Yes 


HRIFA026210a 


0.78 


Yes 


MAMMA1 000652 


F-MAMMA 1000652 


0.94 


Yes 


HRlFA026346a 


0.94 


Yes 


MAMMA1000994 


F-MAMMA1 000994 


0.84 


Yes 


HRIFA026735a 


0.84 


(Yes 


MAMMA1001141 


F-MAMMA 1 001 141 


0.89 


Yes 


HRIFA027265a 


0.89 1 Yes 


MAMMA1001310 


F-MAMMA10013il0 


0.74 


Yes 


HRlFA026899a 


0.74 


Yes 


MAMMA1001344 


F-MAMMA1001344 


0.71 


Yes 


HRIFA0269t8a 


0.71 


Yes 


MAMMA 1002070 


F-MAMMA 1 002070 


0.6 


Yes 


HRIFA028371a 


0.82 


Yes 


MAMMA1 002087 


F-MAMMA1 002087 


0.68 


Yes 


HRIFA027619a 


0.68 


Yes 


MAMMA1002165 


F-MAMMA1002165 


0.57 


Yes 


HRIFA027673a 


0.34 


Yes 


MAMMA1 002205 


F-MAMMA1 002205 


0.74 


Yes 


HRlFA027701a 


0.74 


Yes 


MAMMA1 002633 


F-MAMMA1 002633 


0.53 


! Yes 


HR1FA030461a 


0.94 


► Yes 


NT2RM2000241 


F-NT2RM2000241 


0.94 


Yes 


HRIFA020965a 


0.94 


Yes 


NT2RM2000514 


F-NT2RM2000514 


0.51 


Yes 


HRIFA022106a 


0.51 


Yes 


NT2RM2001643 


F-NT2RM2001643 


0.69 


(Yes 


HRlFA028926a 


0.6S 


i Yes 


NT2RM4000115 


F-NT2RM4000115 


0.56 


i Yes 


HRIFA025792a 


0.53 


1 Yes 


NT2RM4000997 


5F-NT2RM4000997 | 09<J 


\ Yes |HRIFA029274a 


094 


\ Yes 
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Table 8 



The 170 clones in which the selected clones have the maximal ATGprt score 0 5 or hiqher 
and were predicted to contain the signal sequence by the PSORT ' 



name of clone 



name of sequence 



NT2RM4Q0I321 



NT2RM4001325 



NT2RM4001768 



NT2RP1 000446 



F-NT2RM400I3 21 



F-NT2RM40013 25 



F-NT2RM4001768 



maximal 



ATGprt score 



a74 



0.94 



0.73 



NT2RP1001563 



F-NT2RP1 000448 I 062 



NT2RP2001915 



MT2RP200201 5 



NT2RP2002063 



NT2RP2002304 



NT2RP2002674 



F-NT2RP 1001 533 



F-NT2RP2001915 



F-NT2RP2002015 



F-NT2RP200205 3 



0.52 



0.94 



JI.94 
0.87 



Signal name of representative maximal 



sequence 



Yes .HRrFA024533a 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



NT2RP200272I 



NT2RP2003383 



NT2RP2003593 



MT2RP2003599 



MT2RP2003655 



NT2RP2004J79 



NT2RP2004495 



NT2RP20 04524 



MT2RP2004556 



NT 2RP2004837 
N T2RP2005027 



F-NT2RP2002304 ! 087 



F-NT2RP20026 74 [ 0.8 



F-NT2RP20027. 21 



F-r*T2RP2 003383 



F-NT2RP2003593 



F-NT2R P2003599 



F-NT2RP20036 55 



F -m*2RP20041 79 
F -NT2RP20044 95 
F-NT2RP20045 24 



F-NT2RP20045 56 



0.56 



0.67 



0.73 



0.58 



Yes 



HRlFA033349a 



HRJFAO 13668a 



HRIFA005356c 



ATGprt score 



0.74 



si gnal 



Yes 



0.94 



0.5 



HRIFA0dSQl8a 



HRlFA00754la 



HRlFA007619a 



Yes 



HRlFA007659a 
lHRJFA007829a~ 



Yes !HRIFA008099 a 



0.62 



0.52 



0.94 



0.94 



0.94 



0.94 



0.8 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes_jHRtFA008^^ 



Yes HRIFAO08606a 



Yes 



0.78 



0.83 



0.58 



0.73 



NT2RP2005463 



NT2RP2005514 



F-NT2RP20048 37 



F-NT2RP20050 27 



F-NT2R P2005463 



F-NT2RP20055 14 



NT2RP200588 7 ,F-NT2RP20058a7 



NT2RP2006269 



NT2RP3000169 



NT2RP300Q460 



F-MT2RP20Q62 69 



F-NT2RP3OQ01 69 



NT2RP3000769 



NT2RP3000818 



NT2RP30O1012 



NT2RP3001044 



NT2RP3Q01560 



NT2RP300I685 



NT2RP3001853 



NT2RP3002160 



NT2RP3002836 



F-NT2RP30004 60 



F-NT2RP30007 89 



F-NT2RP3000818 



F-NT2RP30010 12 



F-NT2RP3001C44 



0.81 



0.94 



0.92 



0.93 



0.58 



0.94 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



0.78 



0.94 



0.61 



0.62 



0.52 



0.67 



F-NT2RP30015 60 



0.93 



F-NT2RP3001685 



F-NT2RP3001S 58 



F-NT2RP30021 60 



F = NT2RP3002fi36 



OT2RP3002958 F-NT2RP3002S 5ft 



NT2RP3003535 F-NT2RP3^3f i35 



0.58 



0.5 



0.94 



0.61 



068 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



HR]FA008252a 



HRfFA008753a 



HRIFA003784a 



HRIFA008827 



HRIFA009372a 



HR1FA009392a 



HRIFA009414a 



HRIFA006216a 



HR!FA004145a 



HRIFA01 0034a 



HRlFA010070a 



HRIFA0l0322a 



HRlFA025913a 



HRIFA022262 



HRIFA022794a 



0.67 



094 



0.58 



0.83 



0.83 



0.58 



0.82 



0.81 



0.93 



0.93 



0.42 



0.58 



0.94 



0.57 



094 



HRlFA023605a 



HRlFA023619a 



HRIFA0231 29a 



Yes 



Yes 



Yes 



Yes 



NT2RP3004000 



NT2RP3004321 



NT2RP3004355 



NT2RP3004374 



NT2RP4001001 



NT2RP4002715 



OVARC1000298 



OVARC1000775 



OVARC1000811 



OVARC100Q853 



OVARC1001222 



OVARC10Q1807 



OVARC1001833 



P LACEl0uQ23r 



F-NT2RP3004000 



F-NT2RP30043 21 



F-NT2RP30043 55 



F-NT2RP3004374 



F-NT2RP40010Q1 



F-NT2RP4O02715 



F-OVARC100Q 298 



F-QVARC100Q77 5 



F-OVARC 10008 11 



F-OVARC10 0C853 



F-OVARC1 001222 



F-OVAROl6Qia07 



PLACE 1000560 



PIACE1 000740 



F-OVARC 1001 833 



0.54 



0.94 



0.93 



0.81 



0.6 



Yes 



Yes 



HR|FA0O7026a~ 



HRIFA030599a 



HRlFA02352ta 



HRlFA026490a 



HRIFA005760a 



0.61 



0.62 



0.52 



0.22 



0.73 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



No 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



0.92 



0.5 



0.94 



HR|FA024392 a 



Yes 



Yes 



Yes 



0.58 



0.53 



0.94 



JX61 
0J 



0.52 



0.94 



0.79 



0.52 



F~P LACE 1000231 



F-PLACE1 000360 



F-PLACE1 0007 40 



0.9 
0.52 



0.88 



0.57 



Yes 



Yes 



Yes 



Yes 
Yes 



Yes 



Yes 



Yes 



HRIFA0 17670a 



HRlFA025498a 



HR]FA025276a 



HRlFA0257 86a 



HR3F A025360a 



H RiFA024533a 



HRlFA0092l4a 



HRIFA024921a 



HR£A004852a 



HRIFA01 1347a 



HRlFA0O0974 a 



Yes 



HRIFA01 1403a 



Yes 



Ye< 



Yes 



Yes 



Yes 



HRiFA0227l4a 



0.94 



0.72 



0.91 



0.94 



0.93 



0.81 



0.6 



0.74 



0.5 



0.53 



0.59 



Yes 



Yes 



Yes 



Yes 



Yes_ 
Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



0.7 



0.39 



0.94 



HRlFA021069a 



HRIFA021136a 



HRIFA0H802a 



HR1FAQ 12022a 



HRlFA012151a 



0.67 



0.52 



0.9 



0.52 



0.88 



0.57 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 
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Table 9 



The 170 clones in which the selected clones have the maximal ATGprl score 0.5 or higher, 
and were predicted to contain the signal sequence by the PSORT 



name of clone 


lame of sequence r 


maximal 


signal r 


lame of representative r 


naximal : 


signal 






\TGpr1 score 


< 


;equence / 


\TGpr1 score 




PLACE1000912 


F-PLACE 10009 12 


0.9 


Yes 


HRIFA01 2282a 


0.9 


Yes 


PLACE 10009 14 


F-PLACE1000S14 


0.94 


Yes 


HRIFA01 2283a 


0 94 


Yes 


PLACE1 000927 


F-PLACE1000S27 


0.71 


Yes 


HRJFA0 12290a 


0 71 


Y<>4 


PLACE 1000986 


F-PLACE 1000986 


051 


Ya«i 


HR1FA01 2333a 


051 


I ca 


pi Appinfti inn 


F-PLACE1001100 


0.76 


t e& 


MRtFAft19dl7A 
nrurnV I 1 » o 


n 7fi 


Vac 

i es 




F-PLACE100U83 


0.69 


YOS 




n rq 

If .Oil 


Yes 


di Ar*cmni090 


F-PLACE1001229 


0.65 


yes 


rirllr AU 1 /o I «>a 


n r«; 


Yes 




F-PLACE 1001 407 


0.83 


Yes 


n Kir Mu 1 ^VD^a 




Yes I 


KLAOtlUUl too 


F-PLACE1001788 


0.6 


Yes 


nnlrMU I ^oo I a 


n r 


Yes | 


PLACE 1 00Z374 


F-PLACE 1002374 


0.68 


Yes 


LlDirAA 1 AACK. 

rlKir AU ! 3200d 


n do 


Yes I 


PLACE 1 00Z51 8 


F-PLACE1002S18 l 


0.94 


Yes 


HRirAUloo49a 


0.78 


Yes 1 


PLACE! U03839 


F-PLACE 1003*39 \ 0.67 


Yes 


HKlFA014l/oa 


U.o 


Yes I 


PLACE! 003845 


F-PLACE 1 003*45 


0.92 


Yes 


HRIFA0 19185a 


0.92 


Yes | 


PLACE1004199 


F-PLACE1004199 


0.94 


Yes 


HRJFA0 14417a 


054 


Yes 1 


PLACE1 004282 


F-PLACE 10045:82 


0.94 


Yes 


HRIFA01 4467a 


0.94 


Yes 1 


1MP4AA11AC 

PLACE! 004305 


F-PLACE 1 004C105 


0.87 


Yes 


1 mil* k A < liAA 

HRIFAO 14482a 


0^7 


Yes 


PLACE 1 004637 


F-PLACE1004637 


0.89 


Yes 


HRIFAO 14692a 


039 


Yes 


PLACE! 005005 


F-PLACE 1005005 


055 


Yes 


HRIFA01 4953a 


0.55 


Yes ! 


PLACE1 005250 


F-PLACE1 005250 


052 


Yes 


HRIFA015129a 


0.52 


Yes 1 


PLACE1005410 


F-PLACE1 005410 


0.61 


Yes 


HRIFAO 15236a 


0.61 


Yes 


PLACE1 005725 


F-PLACE 1005725 


0.92 


Yes 


HRIFA0 1 5443a 


092 


Yes I 


PLACE 1005 768 


F-P LACE1 005768 


0.62 


Yes 


HRJFA01547la 


0.62 


Yes 


PLACE 1005927 


F-P LACE1005927 


0.66 


Yes 


HRIFAO 15568a 


0.94 


Yes 


PLACE1 006079 


F-P LACE 100607 9 


0.56 


Yes 


HRIFA01 5671a 


056 


Yes~! 


PLACE1 006093 


F-PLACEI006CI93 


0.59 


Yes 


HRIFA01 5682a 


0.59 


Yes 


PLACE1006219 


F-PLACE1006219 


0.94 


Yes 


HRIFA01 5764a 


0.93 


Yes I 


PLACE 1006809 


F-PLACE1006809 


0.66 


Yes 


HRIFA01 6129a 


0.66 


Yes j 


PLACE 1007040 


F-PLACE 1 007040 


0.87 


Yes 


HRIFA01 3288a 


0.87 1 Yes I 


PLACE 1007096 


F-PLACE1007096 


059 


Yes 


HRIFA012167a 


0.82 


Yes | 


PLACE 1007626 


F-PLACE 1007626 


0.67 


Yes 


HR1FA01 6623a 


0.67 


Yes 


PLACE1007971 


F-PLACE1007971 


074 


Yes 


HRIFA01 6838a 


0.74 


Yes 


PLACE1 008985 


F-PLACE1008985 


0.65 


Yes JHRIFA01 7457a 


0.48 


Yes 


PLACE 1009067 


F-PLACE 1009067 


0.59 


Yes iHRlFAOl 7509a 


0.59 


Yes 


PLACE1009196 


F-PLACE1009J96 


0.73 


Yes HRIFA01 7594a 


0.73 


Yes 


PLACE1 009527 


F-PLACE 1009527 


0.58 


Yes 


HRIFA017791a 


0.87 


Yes 


m i Arl AAAAA4 

PLACE1 009982 


F-PLACE1 009982 


0.94 


Yes 


HRIFA01 8075a 


0.94 


Yes 


PLACtlUl IZoo 


F-PLACE101i:!36 


0.52 


Yes 


jHRIFAOl 8827a 


0.66 


Yes 


PLAUb^WUZ 1 9 


F-PLACE2000219 


0.73 


Yes 


HRlFA034010a ! 0.73 


Yes 


jKNMU 1 UUUUU4 


F-SKNMC1000004 


0.94 


Yes 


HRIFA030097a 


0.94 


Yes 


ttjv on t nnoni ft 


F-THYR0 1000336 


0.83 


Yes 


HRIFA027754a 


0.83 


Yes 


TUI VDrv 1 fifWlG Q 

I M Y i uuuuy y 


F-THYRO1000D99 


0.94 


Yes 


HRlFA027803a 


0.94 


Yes 


IMYKU lUUliLO/ 


F-THYR0 100 1237 


0.94 


Yes 


HRlFA030248a 


0.94 


Yes 


7HYRO1001327 


F-THYR0 100 1327 


0.93 


Yes 


HRlFA025125a 


0.94 


Yes 


THYRO1001495 


F-THYRO! 001495 


0.89 


Yes 


HRIFA030394a 


0.89 


Yes 


THYR0 100 1523 


F-THYR0 100 1523 


0.71 


Yes 


HRIFA0304O8a 


0.71 


Yes 


THYRO1001725 


F-THYRO1001725 


0.94 


Yes 


HRIFA029107a 


0.94 


Yes 


Y79AA1 000226 


F-Y79AA 1000226 


0.94 


Yes 


HRIFA027874a 


0.94 


Yes 


Y79AA1000521 


F-Y79AA1 000621 


0.92 


Yes 


HRIFA027961a 


0.92 


|Yes 


Y79AA1 000775 


F-Y79AA1 000776 


0.78 


Yes 


HRIFA028401a 


0.78 


|Yes 


Y79AA1 000959 


F-Y79AA1000SI59 


0.9 


Yes 


HRIFA028465a 


0.9 


HYes 


Y79AA1001013 


F-Y79AA1O01013 


094 


\ Yes 


HRIFA01 1193a 


0.94 


1 Yes 


Y79AA1001264 


F-Y79AA1001;»64 


0.94 


\ Yes 


HRIFA028573a 


0.9< 


I Yes 


Y79AA1001328 


F-Y79AA100KI28 


091 


Yes 


HRIFA028592a 


0.91 


Yes 


Y79AA1001427 


F-Y79AA1001427 


065 


i Yes 


HRIFA028652a 


0.65 


i Yes 
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Table 10 



The 170 clones in which the selected cfones have the maximal ATGpii score 0 5 or Noher 



name of clone 


name of sequence 


maximal 


signal 


name of representative 


maximal 


signal 






ATGprl score 




sequence 


ATGpM score 




Y79AA 1001430 


F-Y79AA1001430 


0.94 


Yes 


HRIFA028654a 


0.94 


Yes 


Y79AA1001530 


F-Y79AA1001530 


0.94 


Yes 


HR1FA01 0206a 


0.94 


Yes 


Y79AA1001592 


F-Y79AA1 001592 


0.94 


Yes 


HRIFA028708a 


0.94 


Yes 


Y79AA1001793 


F-Y79AA1 001793 


0.89 


Yes 


HRIFA032066a 


0.89 


Yes 


Y79AA 100 1795 


F-Y79AA1001795 


0.59 


Yes 


HRIFA032067a 


0.59 


Yes 


Y79AA 100 1863 


F-Y79AA1001863 


0.56 


Yes 


HRIFA032097a 


0.15 


Yes 


Y79AA 1002022 


F-Y79AA1 002022 


0.94 


Yes 


HRIFA033718a 


0.94 


Yes j 


Y79AA 1002373 


F-Y79AA1 002373 


0.79 


Yes 


HR!FA032271a 


0.79 


Yes 
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Table 11 



The 35 clones in which the selected clones have the maximal ATGprl score 0.3 or higher 
and less than 0.5, and were predicted to contain the signal sequence by the PSORT 



name of clone 


name of sequent 


maximal 


signal 


representative 


maximal 


signal 






ATGpf 1 score 




sequence 


ATGpM score 




HEMBA1000907 


F-HEMBA1 000907 


0.39 


Yes 


HRIFA000845a 


0.94 


Yes 


HEMBA 1003602 


F-HEMBA1 003602 


0.43 


Yes 


HRIFA020109a 


0.43 


Yes 


HEMBA 1004797 


F-HEMBA1 004797 


0.45 


Yes 


HRIFA020883a 


0.66 


Yes 


HEMBB1 000447 


F-HEMBB 1000447 


0.31 


Yes 


HRIFA001558a 


0.76 


Yes 


MAMMA10D0591 


F-MAMMA1 00059 1 


0.34 


Yes 


HRlFA026303a 


0.77 


Yes 


MAMMA1Q00681 


F-MAMMA1 000681 


0.35 


Yes 


HRIFA026364a 


0.94 


Yes 


MAMMA1000986 


F-MAMMA1000986 


0.37 


Yes 


HRIFA021611a 


0.37 


Yes 


MAMMA 100 1893 


F-MAMMA1001893 


0.44 


Yes 


HRIFA027485a 


0.9 


Yes 


MAMMA1001957 


F-MAMMA1001957 


0.48 


Yes 


HRIFA027536a 


0.94 


Yes 


NT2RM20O1941 


F-NT2RM2001B41 


0.44 


Yes 


HRIFA032011a 


0.94 


Yes 


NT2RP1000050 


F-MT2RP 1000050 


0.47 


Yes 


HRIFA005102a 


0.54 


Yes 


NT2RP 1000903 


F-M72RP 1000903 


0.38 


Yes 


HRIFA005650a 


0.38 


Yes 


NT2RP2003469 


F-NT2RP2003469 


0.33 


Yes 


HRlFA008661a 


0.9 


Yes 


NT2RP2003664 


F-NT2RP2003664 


0.36 


Yes 


HRIFA008790a 


0.89 


Yes 


NT2RP2004447 


F-NT2RP2004447 


0.36 


Yes 


HRIFA009339a 


0.93 


Yes 


NT2RP2006042 


F-NT2RP2006042 


0.37 


Yes 


HRIFA0 10425a 


0.69 


Yes 


NT2RP3001195 


F-NT2RP3001 1 95 


0.44 


Yes 


HRIFA023227a 


0.94 


Yes 


NT2RP3003354 


F-NT2RP3003354 


0.3 


Yes 


HRIFA008212a 


0.51 


|Yes 


NT2RP3003469 


F-NT2RP3003^9 


0.3 


Yes 


HRIFA025143a 


0.3 


Yes 


NT2RP3003963 


F-NT2RP3003963 


044 


Yes 


HRIFA008949a 


0.62 


Yes 


NT2RP3004133 


F-NT2RP3004133 


0.35 


Yes 


HRIFA025706a 


0.94 


Yes 


NT2RP3004309 


F-NT2RP3004309 


0.4 


Yes 


HRIFA025778a 


0.92 


Yes 


OVARC1 000208 


F-OVARC 1000208 


0.4 


Yes 


HRIFA01 0942a 


0.4 


Yes 


PLACE1001536 


F-PLACE1001536 


0.33 


Yes 


HRIFA012761a 


0.31 


Yes 


PLACE 1003407 


F-PLACE 1003407 


0.48 


Yes 


HRIFA01 3899a 


0.48 


Yes 


PLACE1 003428 


F-PLACE 1003428 


0.37 


Yes 


HRIFA013911a 


0.61 


Yes 


PLACE1 003460 


F-PLACE 1003460 


0.38 


Yes 


HRIFA01 3932a 


0.94 


Yes 


PLACE 1005569 


F-PLACE1005!569 


0.32 


Yes 


HRIFA0l5351a 


0.68 


Yes 


PLACE1006277 


F-PLACE 1006277 


0.32 


Yes 


HRIFA01 5802a 


0.65 


Yes 


PLACE1010251 


F-PLACE1010251 ! 0.32 


Yes 


HRIFA01 8238a 


0.62 


Yes 


THYRO1000196 


F-THYR0 10001 96 


0.44 


Yes 


HRIFA029050a 


0.71 


Yes 


THYR0 1000795 


F-THYRO 1000795 


0.33 


Yes 


HRlFA029327a 


0.92 


Yes 


THYR0 1000999 


F-THYRO1000999 


0.4 


Yes 


HRIFA030203a 


0.4 


Yes 


THYRO1001478 


F-THYRO1001478 


0.47 


Yes 


HRIFA030385a 


0.89 


Yes 


Y79AA1 000426 


F-Y79AA1 000426 


0.471 Yes 


HRIFA027940a 


0.92 


Yes 
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Table 12 



and predicted to be containing the signal sequence by the PSORT 



name of done [ name of sequ ence 
HEMBA1 000300 1F-HEMBA 10003 00 



HEMBA1002164 



HEMBA1 002239 



HEMBA1 002421 



HEMBA 1003294 



HEMBA1 006572 



HEMBA1 007013 



HEMBB 1000567 



F-HEMBA 1002 164 



F-HEMBA1 002239 



F-HEMBA1 002421 



F-HEMBA 10032 94 



F-HEMBA1 0065 72 



maximal | signal [representative 



ATGprl score 

0.1 3| Yes 



HEMBB 1002663 



MAMMA1001043 



MAMMA 100 1284 



MAMMA1001901 



MAMMA 1002224 



NT2RM2000306 



NT2RM2000410 



NT2RP200O479 



NT2RP2001495 



NT2RP2001948 



NT2RP3000645 



NT2RP3003076 



NT2RP4001879 



NT2RP4002451 



QVARC 1000439 



OVARC 100 1727 



LACE 1002080 



PLACE1 002095 



PLACE 1004028 



F-HEMBB 10005 67 



F-HEMBB1 0026 63 



F-MAMMA10010 43 



F-MAMMA 10 0 1284 



F-MAMMA1Q01Q 01 



F-MAMMA 10022 24 
F-NT2RM20OQ3Q6 



F-NT2RM2000410 



F-NT2RP2000479 



F-NT2RP2001495 



F-NT2RP2001948 



F-NT2RP30006't5 



F-NT2RP3003076 



F-NT2RP40Q187 9 



F-NT2RP4002451 



F-QVARC 1000439 



F-QVARC 1001 7 27 
F-PLACE1 0020 30 



F-PLACE1 00209 5 



PLACE 1004482 



P LACE 1005383 



PLACE1 005544 



PLACE1 005660 



PLACE 1006443 



PLACE1 007296 



PLACE1 008469 



PLACE 1008984 



PLACE4000455 



SKNMC 10000 14 



F-PLACE 100402 8 



F-PLACE1 0044 32 



F-PLACE10053 B3 



F-PLACE1 0055 44 



F-PLACE1 0056 60 



F = PLACE1006443 



F-PLACE1 0072 96 



F-PLACE 100846 9 



F-PLACE1 0089 84 



THYR0 100 1702 



Y79AA1Q00270 



Y79AA1 001056 



Y79AA1001803 



F-PLACE40Q04 55 



F-SKNMC10000 14 



F-THYR0 1001 702 



F-Y79AA1 00027 0 
F-Y79AA1 001 0156 



F-Y79AA1001803 



0.06 



F-HEMBA1007Q 13 I 0.19 



0.09 



0.29 



0.17 



Yes 



Yes 



Yes 



Yes 



026 



Yes 



Yes 



0.17|Yes 



0.13 



0.25 



0.22 



0.24 



0.19 



0.29 



0.2 



0.23 



0.26 



0.11 



0.15 



0.22 



0.1 



0.23 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



0.12 



0.13 



0.11 



Yes 



Yes 



Yes 



Yes 



0.08 



0.2 



0.27 



0.22 
0.27 



0.11 



0.23 



0.15 



0.1 



0.21 



0.27 



Yes 



Yes 



Yes 



Yes 



[sequence 
[hR1FA0002847 



Yes 



HRIFA0Q1972a 



maximal 



[signal 



ATGp/1 score 



HRIFA020163a 



HRlFA021543a 



HRIFA02 1906a 



HR1FA029730 ; 



HRIFA031871a 



HRlFA026764a 



HRIFA026889a 



HRIFA027493a 



HRIFA027717a 



HRIFA021985a 



HRIFA022055 a 



HR1FA000822a 



HRlFA007228a 



HRIFA007565a 



HRIFAQ22890 a 



HRlFA024978a 



HRIFA01 7818a 



HR1FA01 8447a 



HRIFA01 1105a 



HR1FA01 9960a 



HR1FA01 3092a 



HRIFA013103a 



HRIFAO 14303a 



HR1FAQ 1459 0a 



HRIFA0 152 19 



a 



HRIFA009852a 
HRIFA01 5409a 



HRIFA01 5902a 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



HRIFA01 6430a 



HRIFA01 7146a 



HRIFA01 7456a 



HRIFA01 2333a 



HR1FA030106a 



HRIFA030511a 



HRIFA005644a 



HRIFA028497a 



HR!FA032073a 




0.15 
0.62 



0.82 



0.15 



029 



Yes 



Yes 



Yes 



Yes 



0.81 



026 



0.17 



0.13 



0.25 



0.82 



0.12 



0.78 



0.89 



0.91 



0.65 



0.79 



0.34 



0.21 



0.22 
0.76 



0.61 



0.12 



0.57 



0.52 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



0.41 



Yes 



Yes 



0.2 



0.89 
027 



0.94 



0.74 



0.51 



0.76 



0.8 



0.63 



0.27 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 



66 



EP1 130 094 A2 



Table 13 



Four clones from which selected clones have the maximal ATGprl score 0.5 or higher, 
and predicted to be lacking the signal sequence by the PSORT 



name of done 


name of sequence r 


Taxi mat 


signal 


name of representative 


maximal 


signal 






TGpn score 




sequence 


ATGprl score 




NT2RP3002281 


F-NT2RP3002281 


0.81 


No 


HR!FA012999a 


0.61 


Yes 


NT2RP3002721 


F-NT2RP3002721 


0.94 


No 


HRIFA023305a 


0.57 


Yes 


NT2RP3004083 


F-NT2RP30040B3 


0.94 


No 


HRIFA008387a 


0.76 


Yes 


PLACE1 005669 


F-PLACE 1005669 


0.941 No 


HRIFA012513a 


0.65 


Yes 



Table 14 



Two clones from which selected clones have the maximal ATGprl score 0.3 or higher and less 
than 0.5 and predicted to have no signal sequence by the PSORT 



name of clone 


name of sequence 


maximal 


signal 


representative 


maximal [signal 






ATGprl score 




sequence 


ATGprl score 




NT2RP3000481 


F-NT2RP3000481 


0.47 


No 


HRIFA028614a 


0.93 


Yes 


NT2RP3003559 


F-NT2RP3Q03559 


0.48 


No 


HRIFA025514a 


0.45 


Yes 



Table 15 

Two clones from which selected clones have the maximal ATGprl score 0 or higher and less than 
0.3, and predicted to have no signal sequence by the PSORT 



name of clone 


name of sequence 


maximal 


signal 


[representative j 


maximal 


signal 






VTGpM score 




sequence 


fcTGprl score 




PLACE 1005601 


F-PLACE 1005601 


0.12 


No 


HRIFA01 0593a 


0.64 


Yes 


PLACE1 006786 


F-PLACE1 006786 


0.22 


No 


HRIFA01 2333a 


0.51 


Yes 



EXAMPLE 12 

A method for the selection of clones based on the top hit data in the homology search against the SwissProt (a method 
for the selection of a novel full-length protein that is predicted to have a function based on the top hit data). 

[01 76] The representative sequences of the clusters were discarded in which the 5'-end sequence is identical (90% 
or more matching), or not similar (the compared part contains a sequence of 25 nucleotides or shorter and the similarity 
is lower than 60%) to the top hit data in the SwissProt. Then, the remaining representative sequences which has 
similarity to the representative sequences of the 5'-ends were classified by a group of the above keywords (some 
representative sequences belong to a group by multiple keywords), and then clones were selected from the clusters. 
The names and the corresponding SEQ IDs of the representative sequences, and also the names of the introns (in- 
cluding the representative sequence of the 5'-end or the 3'-end, or ESTs) used for selecting the clones from the rep- 
resentative sequences and the corresponding SEQ IDs are shown in the last part of the present specification (Table 
368 and 369, respectively). HRIFA indicates the representative sequence of the 5'-end group, and HRIRA indicates 
the representative sequence of the 3'-end group. 

[0177] In principle, from the clusters containing only a single clone (the 5'-end sequence is the representative se- 
quence of the cluster), the clone was selected. However, in the following cases, the clone containing a longer 5'-end 
was selected: where the maximal ATGprl score was less than 0.5, the 5'-end sequence of the clone to be selected 
was not complete, or the 5'-end of the clone was found to be obviously short nevertheless the clone should not be 
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included in the same cluster based on the BUST analysis between the other clones, or further, the 5'-end sequence 
of the said clone, winch corresponds to the 3'-ends of the other clones belonging to the same cluster in which the 3'- 
end of the said clone was included, was turn out to be longer than those of the other clones by assembling them When 
here were two clones in the same cluster, judged to be full-length, and it was difficult to determine which clone has 
the longer 5 -end even by assembling them, all the clones were selected. As a result, the ATGprl score in some clones 
became less than 0.5 or less than 0.3. The fullness ratio of these clones became lower, but there is still a possibility 
that the clones are full-length. y 

[0178] In the case in which multiple clones were contained in a cluster, selected was the clone having the longest 
5 -end m the clones judged to be full-length compared to the human mRNA or human EST. However in the following 
cases, the clone containing a longer 5'-end was selected: where the maximal ATGprl score was less than 0 5 the 5* 
end sequence of the clone to be selected was not complete, or the 5'-end of the clone was found to be obviously short 
nevertheless the clone should not be included in the same cluster based on the BLAST analysis between the other 
clones, or further, the 5'-end sequence of the said clone, which corresponds to the 3'-ends of the other clones belonging 
to the same cluster in which the 3'-end of the said clone was included, was turn out to be longer than those of the other 
clones by assembling them. When there were two clones in the same cluster, judged to be full-length, and it was difficult 
o determine which clone has the longer 5'-end even by assembling them, all the clones were selected. As a result 

n! ! ? SC ° re m S ° me C, ° neS b8Came leSS than 0 5 or less tnan °' 3 - These c,ones can still be full-length 
[0179] Based on the top hit data in the SwissProt homology search : 658 clones were selected. Among them 446 
clones were selected by the keywords, secretion or membrane. Using the keyword, glycoprotein, 243 clones were 
selected. 51 clones were selected by the keywords for signal transduction. With the keywords for transcription 130 
clones were selected. 1 7 clones were selected by the keywords for disease. 

[0180] Among the 446 clones selected by the keywords, secretion or membrane, 77 clones were overlapped with 
those selected by combining the ATGprl score and prediction by the PSORT for the existence of a signal sequence 
Also many clones were overlapped with those selected by the keyword, glycoprotein. Moreover, some clones were 
overlapped with the clones selected by the keywords for diseases. 

[0181] Among the 243 clones selected by the keyword, glycoprotein, 53 clones were overlapped with those selected 
by combining the ATGprl score and prediction by the PSORT for the existence of a signal sequence. Also, many clones 
were ovenapped wrth those selected by the keywords, secretion or membrane. Moreover, some clones were overlapped 
with the clones selected by the keywords in diseases. 

[0182] Among the clones selected by the top hit data in the homology search on the SwissProt, 532 clones were 
having the maximal ATGprl score 0.5 or higher. 59 clones were having the maximal score 0.3 or higher and less than 
0.5. 67 clones were with the maximal score less than 0.3. 

[01 83] When the maximal ATGprl score is 0.5 or higher, 0.3 or higher, no less than 0, the expected fullness ratio is 
as shown in Table 3, 4, 5, and 6, respectively. 



Table 1 6 

The representative sequences of the most homologous sequences 
factor", "cytokine", or "hormone", and the selected clones 



in the SwissProt with the keyword(s) "growth 



name of clone 



HEMBA1001563 

HEMBA1 003047 

HEMBA1 005070 

HEMBA1 006724 

HEMBA1006916 

MAMMA1001066 

MAM MA1 001 634 

MAMMA1002165 

NT2RM4000326 

NT2RM4001377 

NT2RP2000447 

NT2RP2000663 

NT2RP2000903 

NT2RP2002974 

NT2RP2003369 

NT2RP2004141 



name of representative sequence 



HRIFA001439a 

HRIFA002743a 

HRIFA020144a 

HRIFA021620a 

HRIFA021855a 

HRIFA027355a 

HRIFA027187a 

HRIFA027673a 

HRIFA032530a 

HRIFA005300a 

HRIFA006448a 

HRIFA006609a 

HRIFA006798a 

HRIFA027860a 

HRIFA008596a 

HRIFA009123a 
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Table 16 


(continued) 




The representative sequences of the most homologous sequences in the Swiss Prot with the keyword(s) "growth 
factor", "cytokine", or "hormone", and the selected clones. 


5 


name of clone 


name of reDresentative seauence 






HRIFAOIOlfila 






HRIFAO 10466a 




NT2RP3000645 


HRIFA022890a 


10 


NT2RP3000838 


HRIFA005300a 


NT2RP4002451 


HRIFA01 8447a 




OVARC1 000275 


HRIFA0 10988a 




OVARC1001030 


HRIFA021061a 




PLACE 1004492 


HR!FA014598a 


15 


PLACE 1009279 


HRIFA017643a 




THYRO1001071 


HRIFA029440a 




Y79AA1 000207 


HRIFA027867a 




Y79AA1 000426 


HRIFA027940a 


20 


Table 17 


25 


The representative sequences of the most homologous sequences in the Swiss Prot with the keyword(s) "receptor", 
"G-protein coupled receptor", "ionic channel", "voltage-gated channel", or "calcium channel", and the selected 
clones. 




narnp nf rlone 


name of reoresentative seouence 




BNGH41 000091 


HRIFA029511a 




HEMBA1001621 


HRIFA001489a 


30 


HEMBA1 003392 


HRIFA002980a 




HEMBA1 005545 


HRIFA020272a 




HEMBA1 007291 


HRIFA022462a 




HEMBA1 007332 


HRIFA022493a 


35 


MAMMA1 000681 


HRIFA026364a 


MAMMA1 000706 


HRIFA026382a 




MAMMA 1001978 


HRIFA027549a 




NT2RM2001939 


HRIFA032009a 




NT2RM2001941 


HRIFA032011a 


40 


NT2RM4002352 


HRIFA001337a 




NT2RP2002510 


HRIFA0079B5a 




NT2RP2002533 


HRIFA008000a 




NT2RP2005181 


HRIFA005409a 


45 


NT2RP3000304 


HRIFA022616a 


NT2RP3001542 


HRIFA028501a 




NT2RP3002409 


HRIFA024197a 




NT2RP3002836 


HRIFA024392a 




NT2RP3003000 


HRIFA024767a 


50 


NT2RP3004552 


HRIFA025904a 




NT2RP4001877 


HRiFA032433a 




NT2RP4002750 


HRIFA028157a 




OVARC1 000090 


HRIFA01 0859a 




OVARC1 000956 


HRIFA01 1484a 


55 


OVARC1001991 


HRIFA01 9498a 




PLACE1001016 


HRIFA012354a 




PLACE1001340 


HRIFA01 2584a 
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Table 17 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "receptor- 
cfone's °° reCeP ' 0r "' "' 0niC Channer> * Vo,,a 9 e -9 a,ed channel ". ° r channel", and the selected ' 



name of clone 
PLACE1001401 
PLACE 1001564 
PLACE1001655 
PLACE 1002547 
PLACE 1002967 
PLACE 1003573 
PLACE 1003852 
PLACE 1004441 
PLACE1005031 
PLACE1 005878 
PLACE1007296 
PLACE 1008469 
PLACE1010784 
PLACE1010968 
THYR01 000956 
Y79AA 1001 062 



name of representative sequence 



HRIFA01 2625a 
HRIFA01 2737a 
HRIFA01 2795a 
HRIFA01 3376a 
HRIFA01 3620a 
HRIFA01 4006a 
HRIFA014l85a 
HRIFA014561a 
HRIFA01 4967a 
HRIFA01 5536a 
HRIFA01 6430a 
HRIFA017146a 
HRIFA031126a 
HRIFA01 8666a 
HRIFA029393a 
HRIFA023434a 



Table 18 



The representative sequences of the most homologous sequenses in the SwissProt with the keyword(s) 
extracellular matrix", and the selected clones. y w 



name of clone 


[ name of representative 


HEMBA1 000006 


HRIFA027327a 


HE MBA 1000275 


HRIFA000264a 


HE MBA 1000835 


HRIFA000776a 


HEMBA1 000907 


HRIFA000845a 


HEMBA1002164 


HRIFA001972a 


HEMBA1003101 


HRIFA002787a 


HEMBA1 003230 


HRIFA002891a 


MAMMA1 000403 


HRIFA026465a 


MAMMA1001615 


HRIFA022865a 


MAMMA1001893 


HRIFA027485a 


MAMMA1002128 


HRIFA027644a 


NT2RM4000284 


HRIFA032506a 


NT2RM4000295 


HRIFA032511a 


NT2RM4000587 


HRIFA032696a 


NT2RP2000616 


HRIFA006572a 


NT2RP2000694 


HRIFA006633a 


NT2RP2001562 


HRIFA01 0799a 


NT2RP2001948 


HRIFA007565a 


NT2RP2002409 


HRIFA007909a 


NT2RP2004447 


HRIFA009339a 


NT2RP2004847 


HRIFA005944a 


NT2RP2006004 


HRIFA002766a 


NT2RP3000059 


HRIFA022203a 


NT2RP3000616 


HRIFA022875a 
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Table 1 8 (continued) 



10 



20 



30 



The representative sequences of the most homologous sequenses in the SwissProt with the keyword(s) 


"extracellular matrix", and the selected clones. 




name of clone 


name of representative sequence 


NT2RP3000871 


HRIFA023048a 


NT2RP3000921 


HRIFA023069a 


NT2RP3002015 


HRIFA01 5995a 


NT2RP3002448 


HRIFA024218a 


NT2RP3002983 


HRIFA024473a 


NT2RP3003729 


HRIFA025488a 


NT2RP3004067 


HRIFA027327a [ 


OVARC1001049 


HRIFA022702a 


OVARC1001222 


HRIFA022714a 


OVARC 1002058 


HRIFA022737a 


PLACE1001114 


HRIFA01 2427a 


PLACE1 002329 


HRIFA01 3235a 


PLACE1004816 


HRIFA014819a 


PLACE1 005383 


HRIFA015219a 


PLACE1 005569 


HRIFA015351a 


PLACE1 006073 


HRIFA003402a 


PLACE1 006277 


HRIFA01 5802a 


PLACE1 008984 


HRIFA01 7456a 


THYR01001102 


HRIFA008174a 


THYRO1001471 


HRIFA030381a 


THYRO1001478 


HRIFA030385a 


Y79AA1 000888 


HRIFA028440a 



Table 1 9 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"transmembrane", and the selected clones (except overlapped with Table 16, 17, and 18) 


name of clone 


name of representative sequence 


BNGH41 000020 


HRIFA030662a 


HEMBA1000121 


HRIFA000116a 


HEMBA1 000349 


HRIFA000327a 


HEMBA1 000477 


HRIFA000446a 


HEMBA1 000940 


HRIFA002384a 


HEMBA1002163 


HRIFA001971a 


HEMBA1 002421 


HRIFA005392a 


HEMBA1 002767 


HRIFA002503a 


HEMBA1 003945 


HRIFA009220a 


HEMBA1 004250 


HRIFA003504a 


HEMBA1 004391 


HRIFA020693a 


HEMBA1 004444 


HRIFA029285a 


HEMBA1 004454 


HRIFA003592a 


HEMBA1 004505 


HRIFA003635a 


HEMBA1 004797 


HRIFA020883a 


HEMBA1 004982 


HRIFA003892a 


HEMBA1 005489 


HRlFA024543a 


HEMBA1005698 


HRIFA020453a 


HEMBA1 005945 


HRIFA020349a 
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The representative sequences of the most 
"transmembrane", and the selected clones 

name of clone 

HEMBA1 006299 
HEMBA1 006430 
HEMBA1 006482 
H EM BA 1007241 
HEMBB1 000679 
HEMBB1001200 
HEMBB1001573 
HEMBB1 002427 
MAMMA1 000204 
MAM MA 1000473 
MAMMA1000196 
MAMMA1 000788 
MAMMA1000814 
MAMMA1001237 
MAMMA1001418 
MAMMA1 002091 
MAMMA1 002095 
MAMMA1 002586 
NT2RM 1000580 
NT2RM 1000855 
NT2RM 1000858 
NT2RM 1000899 
NT2RM2000565 
NT2RM2000582 
NT2RM2001126 
NT2RM4000198 
NT2RM4000417 
NT2RM4000444 
NT2RM4000593 
NT2RM4000761 
NT2RM4000965 
NT2RM4001735 
NT2RP1000181 
NT2RP1 000261 
NT2RP1 000300 
NT2RP1 000325 
NT2RP1 000551 
NT2RP1 000981 
NT2RP2000533 
IMT2RP2000649 
NT2RP2000818 
NT2RP2001200 
NT2RP2001495 
NT2RP2001514 
NT2RP2001956 
NT2RP2002063 
NT2RP2002232 



Table 19 (continued) 



homologous sequences in the SwissProt with the keyword(s) 
(except overlapped with Table 16, 17, and 18) 



name of representative sequence 
HRIFA025771a 
HRIFA021213a 
HRIFA022328a 
HRIFA022423a 
HRIFA029802a 
HRIFA030839a 
HRIFA031091a 
HRIFA005760a 
HRIFA025966a 
HRIFA01 8870a 
HRIFA026242a 
HRIFA01 8287a 
HRIFA026618a 
HRIFA026860a 
HRIFA027045a 
HRIFA027622a 
HRIFA027625a 
HR!FA027012a 
HRIFA004523a 
HRIFA004696a 
HRIFA004714a 
HRIFA004745a 
HRIFA022139a 
HRIFA021787a 
HRIFA024088a 
HRIFA032453a 
HRIFA032587a 
HRIFA032605a 
HRIFA023489a 
HRIFA023923a 
HRIFA030103a 
HRIFA002063a 
HRIFA005184a 
HRIFA005231a 
HRIFA005255a 
HRIFA005271a 
HRIFA005420a 
HRIFA005702a 
HRIFA006510a 
HRIFA006596a 
HRIFA006730a 
HRIFA007013a 
HRIFA007228a 
HRIFA007243a 
HRIFA007571a 
HRIFA007659a 
HRIFA009783a 
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Table 19 


(continued) 


The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"transmembrane", and the selected clones (except overlapped with Table 16, 17, and 18) 


namp nf clonp 




in I £nrcUUiis&/ 


l-l R 1 P A rt 9 7 R 7 a 
nnlrMUtJ /Of a 


NITPRPP0.0PQ4? 
in i tnrtuutvJHt 


HRIPAfififtPflAa 


MT?RPPf)0?Q7fi 
in i tnrtUwc3 / u 


HRIPArtrtft^lAo 




MRIPAOOA/lRQa 


IN 1 tnrcUU003U 


L-iRIPAnrtQC-M a 
nnlrMUUoDI 1 a 


in i tnr tuuo*toy 


WRIPArtrtQ£R1 a 
n ill rMUUDDD i a 


IN 1 ^nr<:UUODjO 


M R 1 P A C\ Pk&yflA q 

n rii rAUUo / o*f a 


IN 1 LnrLUUODQt 


HRiPAnnR7ona 
nnirnuuo/yud 


IN 1 tnrcUU'JtUJ 


MRIPArtrtOI 71 q 
nrilrAUUy I / I a 


IN 1 tnrtUUt/W 


MRIPAnrtQ^7Ra 

n n i rrtuuyo / oa 


IN 1 tnrtUUO*ttO 


MRIPAH1 nnnKa 
nnirAU i uuuoa 


In 1 ^rln^UUOOy/ 


UIDIPArt-f rt1 Orto 

nrtlrAU lU i oUa 


In 1 cnrtUUODot 


l-l PICA rt1 (HKOq 
nnlrMUl U I O^d 


IN 1 tnrtUUOaaH 


MRIPAH1 rtOQ/la 


IN 1 tnrtUUDtDif 


nrilrAU^oy i oa 


MTPRPPDrtfiCil 9 
In 1 tnrtUUDO l£ 


WRIPArt0^1QQ7'a 

n n i rAU*i4y o / a 


In 1 ^riroUUU 1 <:j 


nnlrAU*i^o4a 


In 1 tnruUUU 1 f 1 


UIOIPArtrt7"700o 

MrilrAUU/ / dd.<i 


In 1 tnrOUUUO/D 


n ri 1 r AU<£bo / oa 


In 1 cnrOUUUyU/ 


nrilrAU^OU4ba 


in I dnroUU I UD I 


UIDIPArtOQi Kvlo 
MrilrAU<io I 34a 


NT9RP<*rtrtH 7rt 
IN I tnroUU I I / U 


UIDIPArti rtrt7Do 

nrilrAU l UU/oa 


In I triroUU 1 1 yo 


Ul DIPA rtO*3007<5 

n ri 1 r Pssjdodd f a 


M~ro pdq nrt -i o/ rt 
In 1 ^riroUU 1^4U 


n M 1 r AU^o / a 


In I tnroUU l>3^ 


nnlrAU*:ooU4a 


IN I tnrOUU I OOO 


nrilrAUUby^ba 


In I tnrOUU I SOU 


UIDIPA rtQrt.CO.Qo 

n n 1 rAUoUoy y a 


in I tnrouu I / *3o 


n n 1 rAU^O / bba 


MTORPQftnOi Art 
In I £nroUUtlDU 


n HI rAUUo /bOa 


MTODPonnoQOyi 
In l^nroUU/ioc4 


nnlrAU<£4oo4a 


IN 1 tnroUUiO'lc 


(-IDIPArtrtRCQCo 

MnlrAUUbooba 


In I *:riroUU^O/ 1 


nrilrAU*:4^:ooa 


In 1 tnrOUUtyUU 


Ul DICAftO/lilOQo 

n ri 1 r AU^44^o3 


IN 1 ^li"OUU*iyoo 


Ul Dl PArt i 7C7rtn 

nrilrAU 1 /b/ua 


IN 1 ^nrjUUJOJt 


nnirAU io4/ /a 


IN 1 (inrJUUoysJy 


Ul Dl CA rtOCCOCo 

n n! r AU^oboba 


IN l *:nroUU4 I oU 


Ul DIPArtOR7rt<Jo 

n ri 1 r A U*io / Uoa 


NT2RP3004133 


HRIFA025706a 


NT2RP3004294 


HRIFA025771a 


NT2RP3004406 


HRIFA025800a 


NT2RP3004481 


HRIFA026089a 


NT2RP3004625 


HRIFA026564a 


NT2RP3004647 


HRIFA026576a 


NT2RP4001009 


HRIFA022227a 


NT2RP4001879 


HRIFA017818a 


OVARC1 000003 


HRIFAO 10790a 


OVARC1000105 


HRIFA007493a 
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Table 1 9 (continued) 



The representative sequences of the most 
"transmembrane", and the selected clones 

name of clone 
OVARC1 000137 
OVARC1 000307 
OVARC1 000331 
OVARC1 000553 
OVARC1 000873 
OVARC1001163 
OVARC1001260 
OVARC1001336 
OVARC1001607 
PLACE1 000740 
PLACE1 001123 
PLACE1 001231 
PLACE1001836 
PLACE1001949 
PLACE1 002095 
PLACE1 002905 
PLACE1 002911 
PLACE1003163 
PLACE1 003644 
PLACE1 003737 
PLACE1 004279 
PLACE1 004450 
PLACE1 004482 
PLACE1 004630 
PLACE1 005544 
PLACE1 005745 
PLACE1 005927 
PLACE1 006290 
PLACE1 007096 
PLACE1 007845 
PLACE1 007881 
PLACE1 008359 
PLACE1008716 
PLACE 1008985 
PLACE1 009600 
PLACE1010011 
PLACE1 010078 
PLACE1010251 
PLACE1010445 
PLACE1010827 
PLACE1011045 
PLACE1011181 
PLACE1011236 
PLACE1011516 
PLACE3000181 
SKNMC1 000082 
THYR0 10001 96 



homologous sequences in the SwissProt with the keyword(s) 
(except overlapped with Table 16,17, and 18) 

name of representative sequence 
HRIFA010891a 
HRIFA002919a 
HRIFA004162a 
HRIFA011197a 
HRIFA032478a 
HRIFA032079a 
HRIFA01 9867a 
HRIFA01 9867a 
HRIFA022729a 
HRIFA012151a 
HRIFA012436a 
HRIFA012515a 
HRIFA012914a 
HRIFA01 2990a 
HRIFA013103a 
HRIFA013586a 
HRIFA01 3589a 
HRIFA01 3744a 
HRIFA01 4056a 
HRIFA014111a 
HRIFA014465a 
HRIFA01 4568a 
HRIFA014590a 
HRIFA014688a 
HRIFA009852a 
HRIFA01 7855a 
HRIFA01 5568a 
HRIFA015811a 
HRIFA012167a 
HRIFA01 6758a 
HRIFA01 6240a 
HRIFA015547a 
HRIFA01 7295a 
HRIFA01 7457a 
HRIFA01 7836a 
HRIFA01 8092a 
HRIFA018131a 
HRIFA01 8238a 
HRIFA01 0736a 
HRIFA01 8580a 
HRIFA014500a 
HRIFA01 8794a 
HRIFA01 8827a 
HRIFA01 8993a 
HRIFA004112a 
HRIFA030147a 
HRIFA029050a 
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Table 19 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"transmembrane", and the selected clones (except overlapped with Table 16, 17, and 18) 


norriP nf o lonf* 

1 ral 1 IC SJl L>IUI 1C7 


nflmp nf rpnrp^pntativp QpniiAnnA 

1 lal 1 IC w 1 1 CUICoCI ILallVc OCVJUCI 


tu yro 1 nnrvLnn 
i n t nu i uuu*fuu 


fl nlrMUUUOD'+d 




H R 1 PA n^Q^HQa 


ini nu i uuud/ o 


HRIPAf19Q9^Ra 


THYR01 nnn77R 


WRIFAH^Q^I 7a 
nmrMU^yo i / a 


thvrhi nnn7Qc; 
i n t nu i uuu / yo 


W R 1 FA OQQQ97a 


i n i nu i uuuouu 


WRIFAH9771 Aa 


I n T r\\J 1 KJU 1 II O 


nnir/w^y*+oua 


THYR01 nfl11 9ft 
i n t nu i uu 1 1 c.o 


HRIFAn9Q/lfi7a 


THYRH1 nrji OAO 
i n t nu i i z.'+c. 


MR I FA n^O^fiHa 


THYR0 1001 266 


HRIFA030264a 


THYRO1001456 


HRIFA030370a 


THYRO1001529 


HRIFA030411a 


THYRO1001702 


HRIFA030511a 


Y79AA1000127 


HRIFA026121a 


Y79AA1 000270 


HRIFA005644a 


Y79AA1001426 


HRIFA028651a 


Y79AA1001787 


HRIFA028790a 


Y79AA1001799 


HRIFA032070a 


Y79AA1002213 


HRIFA032224a 



Table 20 



35 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16, 17, 18 and 19) 


name of clone 


name of representative sequence 


BNGH41 000087 


HRIFA029508a 


HEMBA1000128 


HRIFA000123a 


HEMBA1000443 


HRIFA000415a 


HEMBA1 000590 


HRIFA000553a 


HEMBA1 000634 


HRIFA004780a 


HEMBA1 000745 


HRIFA000695a 


HEMBA1001221 


HRIFA001132a 


HEMBA1001228 


HRIFA001138a 


HEMBA1001390 


HRIFA000071a 


HEMBA1002131 


HRIFA001942a 


HEMBA1002167 


HRIFA00 1975a 


HEMBA1002178 


HRIFA00 1984a 


HEMBA1 002524 


HRIFA002284a 


HEMBA1 002992 


HRIFA002694a 


HEMBA1 003072 


HRIFA002762a 


HEMBA1 003315 


HRIFA000016a 


HEMBA1 003487 


HRIFA003055a 


HEMBA1 003530 


HRIFA003093a 


HEMBA1005145 


HRIFA003946a 


HEMBA1 005337 


HRIFA019651a 


HEMBA1 005449 


HRIFA020707a 


HEMBA1 005522 


HRIFA021398a 
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Table 20 (continued) 



| The representative sequences of the most homologous 


sequences in the SwissProt with the kevword(s) "sianal 0 


I ancJ tne selected clones (except overlapped with Table 16, 17, 18 and 19) 


name of clone 


name of representative sequence 


HEMBA1 006335 


HRIFA01 2069a 


HEMBA1 006572 


HRIFA021543a 


HEMBA1 006707 


HRIFA021499a 


HEMBA1 006749 


HRIFA021637a 


HEMBA1 006902 


HRIFA021754a 


HEMBA1007013 


HRIFA021906a 


HEMBA1 007057 


HRIFA022985a 


HEMBB1 000447 


HRIFA001558a 


HEMBB1 000567 


HRIFA029730a 


HEMBB1000881 


HRIFA029932a 


HEMBB1001026 


HRIFA030025a 


HEMBB1001048 


HRIFA030045a 


HEMBB1001847 


HRIFA031249a 


| MAMMA1000106 


HRIFA024482a 


MAMMA1 000226 


HRIFA025978a 


MAM MA1 000591 


HRIFA026303a 


MAMMA1001043 


HRIFA026764a 


j MAMMA1001957 


HRIFA027536a 


j MAMMA1 002080 


HRIFA01 6963a 


M AM MA1 002234 


HRIFA027722a 


j MAMMA1 002633 


HRIFA030461a 


MAMMA1003126 


HRIFA029263a 


NT2RM 1000462 


HRIFA004426a 


NT2RM 1000542 


HRIFA004490a 


NT2RM2000410 


HRIFA022055a 


NT2RM2000423 


HRIFA022065a 


NT2RM2000622 


HRIFA022156a 


NT2RM2000773 


HRIFA023894a 


NT2RM2001626 


HRIFA028911a 


j NT2RM2001818 


HRIFA03 1935a 


NT2RM4000648 


HRIFA032730a 


NT2RM4001843 


HRIFA024718a 


| NT2RP1 000050 


HRIFA005102a 


NT2RP1001004 


HRIFA005720a 


NT2RP2000394 


HRIFA003640a 


NT2RP2000514 


HRIFA006494a 


NT2RP2001480 


HRIFA007219a 


j NT2RP2001755 


HHIFA007424a 


I NT2RP2001878 


HRIFA007512a 


NT2RP2002188 


HRIFA007745a 


NT2RP2002564 


HRIFA007244a 


NT2RP2002824 


HRIFA008200a 


NT2RP2003042 


HRIFA008362a 


! NT2RP2003593 


HRIFA008252a 


NT2RP2003931 


HRIFA008976a 


NT2RP2004606 


HRIFA009451a 


NT2RP2004648 


HRIFA009482a 
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Table 20 


(continued) 


The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 


and the selected clones (except overlapped with Table 16, 17, 18 and 19) 


namp nf Hont* 


name of reoresentative seouencp 


NTPRPpnnm ri 

IM 1 c. \\ \ c. \J yj ZJ \ DO 


HRIFA0nQR?Ra 




HRIFAOOQflftla 


NT? R P?nnR'V7R 


HRIFA004Q1 Qa 


NT? R P?nnRR4 1 


nnirnu I uvjijua 


NTPRPOnnRRftQ 


HRIFAm n*^1 Oa 
nnirnu I UO I ija 


NT?RP?nnfin4? 


HRIFAm n49Ra 


IN i till JUUUUOw 


HRIFAn??R?fta 


mtp R P^nnnd^R 


HRIFAn??77fia 


NT? R P ^ nn CiAAA 


HRIFAn??7ft?a 


111 1 t. tl 1 \J VJvJ UHO 1 


HRIFAn?RR14.a 


NT?RPQnnn791 
IN 1 tnrOUUUf £ 1 


HRf FAnnQft?Ra 


NTORP^nm m ? 


WRIFAnOQI OQa 

nriirAU^o i<iyd 


MT9RP9Hni 1 RQ 

IN I cnrOUU I I OcJ 


WRIFAnO^I 9a 


MTPRP^nni rq? 


HRIFAn9^4R4.Q 


MTORP^nrH 7R4 


HRIFAnn779fta 


NT? R P^nnp^ 1 1 

im i tnrOuUcO I I 


HRIFAn?A71 Ra 




HRICAnOn74Qa 
nrfirAU^U/ ^od 


MT9RPQnfl97Qn 
IM 1 tnr JUUt / yu 


LIRIFAnOR^I Qq 


MT? R PQnn?ftft 7 
IM 1 cnrOuUcuO/ 


n r» i rMUtyt / oa 


N~n> r P *3 nn <V3 


MRIFAnnfl91 Oq 

nnirMUUo^ l ^a 


nt? r p ^ nn ft 


l-l R 1 F A n9 R47Q a 


NT?RP^nni47'* 


HRIFAnm.41 

n n i rMUU i *i i oa 


MT9RPQnnQR1 A 


n n I r auo iD4c a 


MT9 R P^nn^ny 

IM Icn r* OUU'IU / o 


ririirAU 1 uoui a 


MTORP^nn/mon 

IM I cnrOUUHUyU 


UQICAn07^0QQ 

nnirAUc/ocya 


MT9 r p ^ nn 4? nc> 


WRIFAn9RQ97a 

nriir/AU^oo^/ a 


NT? r P^nnd/^nQ 


MRIFAnQ^77fta 

nriirAU^o/ /oa 


NT? R PQnn4/*4R 


MRIFAnOt^QRQQ 


MTORP*5nnjlKR7 


nnirAU^oyu/ a 


IN 1 4Lr\rH\j\j 1 ho/ 


MQICAni QOTRq 

nnlrAUl o^/oa 


OVARP1 nnn^i ^ 


nriirAU I i u i Od 


HVARP1 nnn4i n 

UvnnO I UUU*f I U 


MRIFAnQ9f\Q1 o 

nnirnUttDy i a 


nvARn nnn^WQ 


HRIFAm 1 1 nRa 
nriirMU 1 1 1 UOd 


o\/arpi nm nfiR 


HRIFAm 1 ^ADa 






o\/ARnnni «^7n 

UVnnU I UU I O/ U 


nriirAUi <ia 


pi Ar w Pmni/in7 

rLnOC I UU 1 HKJ/ 


nnirAU i ^uoya 


PLACE1001464 


HRIFA01 3276a 


PLACE1001516 


HRIFA01 2702a 


PLACE1001795 


HRIFA01 2885a 


PLACE1001918 


HRIFA01 2969a 


PLACE1 002080 


HRIFA0130921a 


PLACE1002153 


HRIFA013135a 


PLACE1 002355 


HRIFA01 3254a 


PLACE1 002374 


HRIFA01 3265a 


PLACE1 002726 


HRIFA01 8688a 


PLACE 1003428 


HRIFA013911a 
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Table 20 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "signal", 
and the selected clones (except overlapped with Table 16, 17, 18 and 19) 



name of clone 



PLACE1 003460 
PLACE 1003772 
PLACE1 004078 
PLACE1 004520 
PLACE1 004648 
PLACE1 004887 
PLACE1 005426 
PLACE 1006071 
PLACE1 006443 
PLACE1006716 
PLACE1 006959 
PLACE1 007077 
PLACE1 007081 
PLACE1 007702 
PLACE1 008657 
PLACE1 008744 
PLACE1 009546 
PLACE1011116 
PLACE1011708 
PLACE2000118 
PLACE3000213 
PLACE4000354 
THYR01 000061 
THYR0 1000846 
THYRO1001063 
THYRO1001608 
THYRO1001803 
Y79AA1 000876 
Y79AA1001090 
Y79AA1001272 
Y79AA1001727 
Y79AA1001803 
Y79AA1 002376 



name of representative sequence 



HRIFA01 3932a 
HRIFA014133a 
HRIFA01 4336a 
HRIFA01 4621a 
HRIFA0 14702a 
HRIFA01 4868a 
HRIFA01 5246a 
HRIFA01 6639a 
HRIFA01 5902a 
HRIFA01 6070a 
HRIFA016214a 
HRIFA01 6639a 
HRIFA0 16290a 
HRIFA01 6669a 
HRIFA01 7257a 
HRIFA01 7312a 
HRIFA017801a 
HRIFA01 8754a 
HRIFA019105a 
HRIFA024994a 
HRIFA01 5486a 
HRIFA01 5486a 
HRIFA01 3279a 
HRIFA029349a 
HRIFA029434a 
HRIFA050456a 
HRIFA030566a 
HRIFA030629a 
HRIFA028511a 
HRIFA028576a 
HRIFA006642a 
HRIFA032073a 
HRIFA032820a 



Table 21 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 
"glycoprotein", and the selected clones 


name of clone 


name of representative sequence 


BNGH41 000087 
BNGH41 000091 
HEMBA1 000275 
HEMBA1 000349 
HEMBA1 000590 
HEMBA1 000634 
HEMBA1 000835 
HEMBA1 000907 


HRIFA029508a 
HRIFA029511a 
HRIFA000264a 
HRIFA000327a 
HRIFA000553a 
HRIFA004780a 
HRIFA000776a 
HRIFA000845a 
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Table 21 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone 


name of representative sequence 


HEMBA1001221 


HR!FA001132a 


HEMBA1001228 


HRIFA001138a 


HEMBA1001621 


HRIFA00 1489a 


HEMBA1002131 


HRIFA001942a 


HEMBA1002164 


HRIFA001972a 


HEMBA1002167 


HRIFA001975a 


HEMBA1002178 


HRIFA001984a 


HEMBA1002316 


HRIFA002102a 


HEMBA1 002421 


HRIFA005392a 


HEMBA1 002767 


HRIFA002503a 


HEMBA1 003047 


HRIFA002743a 


HEMBA1003101 


HRIFA002787a 


HEMBA1 003230 


HRIFA002891a 


HEMBA1 003392 


HRIFA002980a 


HEMBA1 004250 


HRIFA003504a 


HEMBA1 004391 


HRIFA020693a 


HEMBA1 004444 


HRIFA029285a 


HEMBA1 004454 


HRIFA003592a 


HEMBA1 004505 


HRIFA003635a 

■ ■ W m 9 9 m » ^wr %hPj 


HEMBA1 005449 


HRIFA020707a 


HEMBA1 005489 


HRIFA024543a 


HEMBA1 005522 


HRIFA02 1398a 


HEMBA1 005545 


HRIFA020272a 

t ll III • % W 't—m r m 


HEMBA1 006335 


HRIFAO 12069a 


HEMBA1 006572 


HRIFA021543a 

P 9 9 WWW ¥ P &m P ^mW ¥ P 


HEMBA1 006707 


HRIFA021499a 


HEMBA1 006724 


HRIFA021620a 


HEMBA1 006749 


HRIFA021637a 


HEMBA1 006902 


HRIFA021754a 


HEMBA1 007057 


HRIFA022985a 


HEMBA1 007332 


HRIFA022493a 

P 9 ¥ IIP ¥ PJ tt i. *■ ii W «tap*» 


HEMBB1 000447 


HRIFA001558a 


HEMBB1 000567 


HRIFA029730a 


HEMBB1 000679 


HRIFA029802a 


HEMBB1 000881 


HRIFA029932a 

P IP IIP * \ M V' P_<% 


HEMBB1001048 


HRIFA030045a 


HEMBB1 002427 


HRIFA005760a 

■ ■ 1 1 1 1 r P » p*^ PjA 


MAMMA1000106 


HRlFA024482a 


MAMMA1 000403 


HRlFA026465a 


MAMMA 1000591 


HRIFA026303a 


MAMMA1 000681 


HRIFA026364a 


MAMMA1 000706 


HRIFA026382a 


MAMMA1001043 


HRIFA026764a 


MAMMA1001237 


HRIFA026860a 


MAMMA1001615 


HRIFA022865a 


MAMMA1001893 


HRIFA027485a 


MAMMA1001978 


HRIFA027549a 
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Table 21 (continued) 



The representative sequences of the most homologous sequences in the SwtssProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone* 


name or representative sequence 


i MAMMAinn?n7n 

tVlrAIV(IVI/"\ 1 UUcU / U 


HRIFA028371a 


MAMMA mm HQ1 


HRIFA027622a 


MAMMA1 flOPI PR 


HRIFA02 7644a 


MAMMA 1 Oft? ciRR 


II II — A i^/"i~»r\ -j 

HRIFA027012a 


MAMMAinn/*1Pfi 


i j n i ^ a ^v/^ ft ft 

HRIFA029263a 


NT? R M 1 ClClCiARO 


HRIFA004426a 


NT?RMmnn^4? 


1 i i ^ a ft ft a * ft 

HRIFA004490a 


NTPR M 1 nnnRRn 

1 1 1 C-l 1 1 VI 1 uuuoou 


HRIFA004523a 


NT?RM?nnn4?^ 


HRIFA022065a 


NTPRMPOOi RPR 


HRIFA028911 a 


NTPRMPnm 7Q? 


1 1 n 1 A y*\ y\ ft. ft ft. ft 

HRIFA029002a 


NTPRMPnm R1 q 


HRIFA031935a 


MT>>RM9nni qqq 


ft i i f~~ a ft. ft; ^^ft ft. 

HRIFA032009a 


NTPRMPfim Q41 


HRIFA032011a 


NT?RM4finm Qfl 


HRIFA032453a 


NT? R M4nnn?R4 


1 ) r™j I ^ a ft ft ft r~ r*\ ft 

HRIFA032506a 


NT?RM4nnn4i 7 


HRIFA032587a 


in 1 ^niviHUUUoo/ 


HRIFA032696a 


NT?RM4nnnR4R 

IN 1 c. niVI 4 HJVJUD*fO 


HRIFA032730a 


NT?RM4nmfl4'3 

IN 1 tFllVI<4UU 1 OfO 


1 1 r*l 1 f — A ft ft jk *—y J ft 

HRIFA02471 8a 


NT?RM4nn?'3«>? 
in 1 ^nivifuu^oo^ 


f 1 ^> 1 A ft ft ^ ft a — » 

HRIFA001337a 


mtp R p i nnnnno 


1 1 l™t 1 A ft. ft - » 

HRIFA005072a 


In I c.v\\r I uuuuou 


HRIFA005102a 


IN I n r | UUUO I v> 


t i r*s 1 1~~ a ft ft t — ^ «^ 

HRIFA005462a 


WTPRP1 nnnom 

In 1 £.ri r | UUUyo 1 


HRIFA005702a 


NTPRPinmnn/i 

IN I tnr 1 UU 1 UU*f 


| 1 ■ r - a ft ft r~ ft 

HRIFA005720a 


NT?RP?nnn^Q4 

in i tnrtuuuoy*f 


i i 1 1™ a ft ft ft ft a ft 

HRIFA003640a 


njtpr ppnnnR 1 4 

IN 1 cnr^UUUu 1 H 


i I 1 ■"" A ft ft ft a ft A ^ 

HRIFA006494a 


NTPRPPnnnRi r 

IN 1 trin^UUUD 1 D 


HRIFA006572a 


NT?RP?nni4Rn 


1 I r*x ■ | — a ft ft t ^» j a. 

HRIFA007219a 


NTPR P?nn 1 rro 


HRIFAO 10799a 


NTPRPPnm 7RR 
IN 1 tnrtUU t t OO 


1 1 1 1^ A ft ft A A 

HRIFA007424a 


NTPRPPnm ft 7R 


1 j ■ i~" a ft ft t r - j ft _ 

HRIFA007512a 


NTPRPP0HP1 RR 
IN 1 tnrtUUt 1 OO 


1 1 1 A ft ft a r~ 

HRIFA007745a 


ntp R pp n CSOLCiA 


| ifti r- a ft ft -^ft j-x ft 

HRIFA007829a 


NT?RP?nn^4no 


1 1 f-~\ 1 | — A ft ft ™* ft. ft ft 

HRIFA007909a 


NTPRPpnnpRi n 

IN 1 tnrtUUtO 1 U 


HRIFA007985a 


NTPRPPnnpR^^ 


HRIFA008000a 


NT2RP2002564 


HRIFA007244a 


NT2RP2002942 


HRIFA008284a 


NT2RP2003042 


HRIFA008362a 


NT2RP2003469 


HRIFA008661a 


NT2RP2003931 


HRIFA008976a 


NT2RP2004205 


HRIFA009171a 


NT2RP2004447 


HRIFA009339a 


PnT2RP2004606 


HRIFA009451a 


NT2RP2004648 


HRIFA009482a 



80 



EP 1 130 094 A2 



Table 21 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone 


name of representative sequence 


NT2RP2004847 


HRIFA005944a 

V 9 9 9 ¥ ¥ 9 1 Vtm* S9* \m* ¥ ¥ K^Wl 


NT2RP2005181 


HRIFA005409a 


NT2RP2005247 


HRIFA009881a 


NT2RP2005541 


HRIFA010090a 


NT2RP2005597 


HRIFA010130a 

■ ■ ■ WWW 9 9 VmW 9 **mT W 9*0 %rA\ 


NT2RP2005632 


HRIFA010152a 

■ W ¥ 111 99 WmW I 9m9 * WkmW 1 Wum 


NT2RP2005883 


HRIFA010319a 

■ ■ ■ ¥ ' 9 9 9 W ^mV WMW | m±jr %jj9 


NT2RP2006004 


HRIFA002766a 


NT2RP2006042 


HRIFA01 0425a 

¥ ¥ 9 9 W ¥ W 9 VmW ¥ W ^mm 


NT2RP2006269 


HRIFA025913a 


NT2RP3000059 

www m m mm m m m' ^m* ™9r ^ar W*r ^ar 


HRIFA022203a 


NT2RP3000063 


HRIFA022528a 

1 It III 9 WbbWkVU 


NT2RP3000125 


HRIFA022234a 

1 ■ ■ II 1 9 \ ^_ ^_ ^_ *mm* ¥ L4 


NT2RP3000304 


HRIFA022616a 

■ Mill 9 % 9*4* ^^^^ f WL4 


NT2RP3000481 


HRIFA028614a 

I II III * mvi— %/\^ 1 >vt 


NT2RP3000616 


HRIFA022875a 

I II IP! * >V ■ ■ W » X^ ^4 


NT2RP3000871 

m mt u mt mm m mm %^ ^m* *mV 9 I 


HRIFA023048a 

1 II 111 W XXm* tmm\mf V f 


NT2RP3000921 


HRlFA023069a 

I II III f % \S t \S \jf \J u 


NT2RP3001012 


HRIFA023129a 

1 1 1 1 1 1 / l \-/ L — V-/ | £ — V WJk. 


NT2RP3001061 

■ v ■ Aav 9 9 ¥ vmT ■ 9m9 ■ 


HRlFA023154a 


NT2RP3001159 


HRIFA023212a 

1 II III / IV^WL. 1 U^l 


NT2RP3001542 

■ V I *— 9 ■ I V*r ¥ *mmT ¥ mmm* 


HRIFA028501a 

i ii ill w \ \y c^- \-/>«-«' v ■ t-i 


NT2RP3001560 

I V I mmm* 9 9 1 wmw ■ %r ^imr \m* 


HRIFA030599a 


NT2RP3001754 


HRIFA007728a 

■ 1 * III 9 \ \m* \f § 9 <_ 


NT2RP3002015 

1*1 mmm, I VI m a^ Cab X* ■ X^ 


HRIFAO 15995a 

1 II III f\\J 1 wvKlwCl 


NT2RP3002160 


HRIFA005760a 

i ii til r\Ww r uvu 


NT2RP3002311 


HRIFA024718a 

1 II III # ^_^T » p \_/ w4 


NT2RP3002448 


HRIFA024218a 

» ■■ III 9 \ m m9' mmm. W ^ mmr 1 


NT2RP3002738 

■ ^ ■ ^mm 9 9 9 ^mw ™mr ^mr mmm w ytm* 


HRIFA020748a 

1 II III r\VL>v r ~Vt4 


NT2RP3002836 


HRIFA024392a 

1 II III f \\J \mm>T^\*J ^ £—\Jk 


NT2RP3002958 


HRIFA01 7670a 

1 II 111 f\ v 1 / \J / V C4 


NT2RP3002983 


HRIFA024473a 

I 1 1 1 1 I 9 9 ^Smmm* W ¥ M V-^ 1^4 


NT2RP3003000 


HRIFA024767a 

1 1 I 1 1 1 # W^— W w / L4 


NT2RP3003076 


HRIFA024978a 

1 I I I 1 1 f IVb 71/ # %J \-A 


NT2RP3003354 


HRIFA008212a 

1 II III t\ \J y^JL— l U 


NT2RP3003532 


HRIFA01 3477a 

1 II III / »v 1 / * CA 


NT2RP3003729 

I W I mmwrn fl ■ 1 9tmr ^tmW \mf ImmT ¥ V 


HRIFA025488a 

i ii hi rivkv^u uu 


NT2RP3004130 


HRIFA025703a 

1 f 1 III F % V<* A— 9 \m* Km* \*A 


NT2RP3004133 


HRIFA025706a 


NT2RP3004309 


HRIFA025778a 


NT2RP3004481 


HRIFA026089a 


NT2RP3004552 


HRIFA025904a 


NT2RP3004625 


HRIFA026564a 


NT2RP3004640 


HR!FA030250a 


NT2RP4000108 


HRIFA001341a 


NT2RP4001467 


HR!FA013276a 


NT2RP4001877 


HRIFA032433a 
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Table 21 (continued) 



The representative sequences of the most 
"glycoprotein", and the selected clones 

name of clone 



NT2RP4002750 
OVARC1 000003 
OVARC1 000090 
OVARC1000313 
OVARC1 000467 
OVARC1 000553 
OVARC1 000873 
OVARC1 000956 
OVARC1001049 
OVARC1001086 
OVARC1001260 
OVARC1 001336 
OVARC1001569 
OVARC1001570 
OVARC1001607 
OVARC1001991 
OVARC 1002058 
PLACE1 000740 
PLACE1001016 
PLACE1001114 
PLACE1001123 
PLACE1001231 
PLACE1001407 
PLACE1001464 
PLACE1001516 
PLACE1 001564 
PLACE1001655 
PLACE1001836 
PLACE1 002095 
PLACE 1002329 
PLACE 1002355 
PLACE1 002374 
PLACE1 002905 
PLACE1 002911 
PLACE1003163 
PLACE1 003428 
PLACE 1003438 
PLACE 1003573 
PLACE1 003737 
PLACE1 003852 
PLACE1 004441 
PLACE1 004450 
PLACE1 004520 
PLACE1 004630 
PLACE1004816 
PLACE1 005003 
PLACE1 005383 



homologous sequences in the SwissProt with the keyword(s) 



name of representative sequence 



HRIFA028157a 

HRIFA010790a 

HRIFA01 0859a 

HRIFA011016a 

HRIFA011128a 

HRIFA011197a 

HRIFA032478a 

HRIFA011484a 

HRIFA022702a 

HRIFA01 1580a 

HRIFA019867a 

HRIFA01 9867a 

HRIFA022728a 

HRIFA019412a 

HRIFA022729a 

HRIFA01 9498a 

HRIFA022737a 

HRIFA012151a 

HRIFA012354a 

HRIFA012427a 

HRIFA012436a 

HRIFA012515a 

HRIFA01 2069a 

HRIFA013276a 

HRIFA01 2702a 

HRIFA012737a 

HRIFA012795a 

HRIFA012914a 

HRIFA013103a 

HRIFA013235a 

HRIFA0 13254a 

HRIFA0 13265a 

HRIFA013586a 

HRIFA0 13589a 

HRIFA013744a 

HRIFA013911a 

HRIFA013919a 

HRIFA01 4006a 

HRIFA014111a 

HRIFA014185a 

HRIFA014561a 

HRIFA01 4568a 

HRIFA014621a 

HRIFA014688a 

HRIFA014819a 

HRIFA014951a 

HRIFA015219a 
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Table 21 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 


"glycoprotein", and the selected clones 




name of clone 


name of representative sequence 


PLACE1 005426 


HRIFA01 5246a 


PLACE1 005539 


HRIFA029425a 

1 II 111 i \ W £— \_/^T x/ C* 


PLACE 1005544 


HRIFA009852a 

■ t ■ ivy # iv \v xy w vw v«4 


PLACE 1005569 


HRIFA015351a 

1 II III /»W 1 WV W 1 Xr* 


PLACE1 006071 

t 1— * ■ ■ w w^ x^ # ■ 


HRIFA01 6639a 

1 II III #\^/ 1 UUui/u 


PLACE 1006073 

I 1 r ■ X— <^ L^. 1 X^ X^ X^ X^ » X** 


HRIFA003402a 

1 II III VU*TVL.U 


PLACE1 006277 


HRIFA01 5802a 

i i ■ ill i xJUvlu 


PLACE 1006290 

1 " i ■ x^ V X^fc» x^ x^ 


HRIFA015811a 

i ii hi /\ \j i >ju i i ci 


PLACE1 006443 


HRIFA01 5902a 

f II III fiw 1 wwWx_U 


PLACE1006716 

I X»# V X^ ■ X^ X^ ■ • Xrf 


HRIFA01 6070a 

1 11 III /\ W t vv ' W U 


PLACE 1007077 


HRIFA01 6639a 


PLACE1 007081 


HRIFA0162Q0a 

1 1 1 1 1 1 Hv 1 U^C7 va 


PLACE1 007845 

1 LOW x_ I v \J § \J X^ 


HRIFA0167RRa 

1 II 111 r\U 1 U# JUa 


PLACE1 008469 

1 » X^ I X^ X^ X^ W X^ 


HRIFA017146a 

1 11 III r» w 1/1 ^tUu 


PLACE1008716 


HRIFA01 7295a 

1 II ill nu I f C\J+JCX 


PLACE 1008744 


HRIFA017312a 

1 II 111 i\\J 1 f w 1 uu 


PLACE 1008984 

1 U/iV U> 1 wwx-*wx-*~ 


HRIFA0174^6a 

i ii in nu i / "+*jt»a 


PLACE1 008985 

1 UnVU> 1 Wwx^wUx^S 


HRIFA0174 I S7a 


PLACE1 009527 


HRIFA0177Q1a 


PLACE1010251 

i urtvLi. i w i v-'i_xj i 


HRIFAOIftP^Ra 


PLACE1010784 

■ 1 r ix* x_ ■ XV ■ W # x/ r 


HRIFA031126a 

1 II III r^\J\J 1 1 £.Uu 


PLACE1010968 


HRIFA01 8666a 

i ii in nu i uuuuq 


PLACE1011116 


i 1 1 1 1 rnu ii) / o*+ci 


PLACE1011708 


HRIFA01Q10Sa 

i nil rnu • C7 i uja 


PLACE2000118 


HRIFA0P4QQ4a 

1 llll r nU£.'iC/ v7^Cl 


PLACE3000181 


HRIFA004112a 


PLACE3000213 

1 1— #»X^ L. W V W Wd- 1 W 


HRIFA01 54R6a 

i ii iii ny i ^jtuuQ 


PLACE4000354 

1 | IX/ b»~v W w V ■ 


HRIFA01 5486a 

1 IT ill / i V/ 1 x/tUwQ 


SKNMC1000014 

XJ 1 xl w IV* V/ 1 vw ww 1 *T 


HRIFA0^010fia 


THYRO1000196 

1 1 1 1 1 lV*r | www 1 x7xJ 


HRIFAflPPORfla 


THYR01 000584 

trill iv i vVvx/U 


HRIFAn?Q9f)Qa 


THYR01 000956 

■ ■■■I l\/ 1 WW^WWSVW 


HRIFA029393a 

1 II 111 r\ VL Jwv/OQ 


THYRO1001102 

■ ■ • I w I >— * I W 1 1 W *— 


HRIFA0081 74a 

1 II 111 / \ V/vU 1 / *+Q 


THYRO1001128 

■ ■■■VI X^ 1 W XV 1 1 x-* 


HRIFA029467a 

i ii iii nyLj^u / a 


THYRO1001266 


HRIFAf)^nPfi4a 

n ii I rnuoUtDla 


THYRO1001803 

1 1 1 1 1 1 w 1 w w 1 UvU 


HRIFAO^O^fifia 


Y79AA1000127 

1 f w/x/x 1 www 1 L / 


HRIFA0?61?1a 


Y79AA 1000207 

1 9 W 9\9 \ 1 WWW^Wf 


HRIFA0P7Rfi7a 
i iiii rnuc / ou / d 


Y79AA1 000270 


HRIFA005644a 


Y79AA1 000426 


HRIFA027940a 


Y79AA1 000888 


HRIFA028440a 


Y79AA1 001062 


HRIFA023434a 


Y79AA1001272 


HRIFA028576a 


Y79AA1 001426 


HRIFA028651a 


Y79AA1001523 


HRIFA030642a 


Y79AA1001727 


HRIFA006642a 


Y79AA1001863 


HRIFA032097a 
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Table 22 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "serine/ 
threonine-protein kinase", "tyrosine-protein kinase", or p calmodulin-binding", and the selected clones 



name of clone 



HEMBA1001878 

HEMBA1002195 

HEMBA1 002227 

HEMBA1 002551 

HEMBA1 005084 

HEMBA1 005913 

HEMBA1 005929 

HEMBB1 000668 

MAMMA1 000881 

MAMMA1001150 

MAMMA1002142 

NT2RM2000589 

NT2RM2001902 

NT2PP1001020 

NT2RP1001031 

NT2RP2001469 

NT2RP2001529 

NT2RP2001769 

NT2RP2003179 

NT2RP2003545 

NT2RP2004670 

NT2RP3000011 

NT2RP3000022 

NT2RP3000172 

NT2RP3000201 

NT2RP3000820 

NT2RP3003527 

NT2RP3003849 

NT2RP3003874 

NT2RP4000634 

NT2RP4000962 

OVARC1 000255 

OVARC1 000529 

OVARC1 000916 

OVARC1001338 

PLACE1003135 

PLACE1005519 

PLACE1 005736 

PLACE1 008282 

PLACE1 008297 

PLACE1010081 

PLACE1011364 

PLACE1011824 

THYRO1001205 

THYRO1001457 

THYR0 1001 593 



name of representative sequence 



HRlFA001712a 

HRIFA01 7703a 

HRIFA019136a 

HRIFA002309a 

HRIFA020184a 

HRIFA029866a 

HRIFA020335a 

HRIFA029792a 

HRIFA026659a 

HRIFA026813a 

HRIFA027656a 

HRIFA021794a 

HRIFA031986a 

HRIFA005728a 

HRIFA005732a 

HRIFA028061a 

HRIFA007256a 

HRIFA007435a 

HRIFA008459a 

HRIFA008717a 

HRIFA028468a 

HRIFA022177a 

HRIFA022182a 

HRIFA022265a 

HRIFA022546a 

HRIFA018262a 

HRIFA025492a 

HRIFA025250a 

HRIFA025261a 

HRIFA029866a 

HRIFA027681a 

HRIFA01 0975a 

HRIFA011179a 

HRIFA011449a 

HRIFA01 9869a 

HRIFA01 3726a 

HRIFA01 5070a 

HRIFA01 5453a 

HRIFA01 6654a 

HRIFA017031a 

HRIFA018134a 

HRIFA01 8904a 

HRIFA019175a 

HRIFA030237a 

HRIFA030371a 

HRIFA030448a 
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Table 22 (continued) 





The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "serine/ 




threonine-protein kinase", "tyrosine-protein kinase", or "calmodulin-binding", and the selected clones 


5 


name of clone 


name of representative sequence 




THYRO1001700 


HRIFA030509a 




THYRO1001770 


HRIFA030545a 




Y79AA1 000777 


HRIFA028402a 


10 


Y79AA1 000967 


HRIFA028468a 


Y79AA1 002381 


HRIFA032275a 



Table 23 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 
"transcription regulation" and "activator", "transcription regulation" and "repressor", or "nuclear protein" and 
"repressor", and the selected clones 


name of clone 


name of representative sequence 


HEMBA1 000462 


HRIFA000432a 




HEMBA1 000671 


HRIFA000631a 




HEMBA1 000875 


HRIFA000814a 




HEMBA1001184 


HRIFA001099a 




HEMBA1 001296 


HRIFA001200a 




HEMBA1001886 


HRIFA001720a 




HEMBA1 002048 


HRIFA001866a 




HEMBA1 002985 


HRIFA002689a 




HEMBA1 003120 


HRIFA002805a 




HEMBA1 003497 


HRIFA003063a 




HEMBA 1004007 


HRIFA021040a 




HEMBA1 004067 


HRIFA003340a 




HEMBA1 004085 


HRIFA003357a 




HEMBA1 004785 


HRIFA020862a 




HE MBA 1004952 


HRIFA01 9532a 




HEMBA1 004971 


HRIFA003883a 




HEMBA1 005230 


HRIFA004006a 




HEMBA1 005246 


HRIFA01 9490a 




HEMBA1 005267 


HRIFA004034a 




HEMBA1 006276 


HRIFA02 1224a 




HEMBA1006517 


HRIFA02 1445a 




HEMBA1 006544 


HRIFA02 1494a 




HEMBA1 006770 


HRIFA021651a 




HEMBA1 006912 


HRIFA022335a 




HE MBA 1007063 


HRIFA022348a 




HEMBA1 007226 


HRIFA022411a 




HEMBB1000106 


HRIFA028262a 




HEMBB1 000309 


HRIFA029602a 




HEMBB1 000407 


HRIFA029649a 




HEMBB1 000542 


HRIFA029715a 




HEMBB1001959 


HRIFA031336a 




HEMBB1 002039 


HRIFA031395a 




HEMBB1 002041 


HRIFA031397a 




HEMBB1 002051 


HRIFA026351a 




HEMBB1 002120 


HRIFA031438a 
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Table 23 (continued) 

The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 
-transcription regulation- and "activator", "transcription regulation" and "repressor", or "nuclear protein" and 
•repressor", and the selected clones 



name oi cione 


I name of representative sequence 


tit MDDl 00*co02 


\ HRIFA009136a 


ntMDDi 00266 1 


HRIFA031869a 


M AMIVIAI 000528 


HRIFA026265a 


MAMMA! 000614 


HRIFA026316a 


MAMMAI 00081 0 


I HRIFA026615a 


KA AKAKA A 4 r\r\ h nn a 
MAMMA 1 001 094 


HRIFA026789a 


KA A Kit flX A 4 f\r\ ^ rnn 

MAMMA 1 001532 


HRIFA027125a 


MAMMAI 001 609 


HRIFA027173a 


MTOQ&JH AAAvlAT 

ni I ^HMI 000407 


HRIFA004401a 


N I 2RM1 000789 


HRIFA004663a 


NT2RIvl2001558 


HRIFA028804a 


IM 1 2HM2001 738 


HRIFA028867a 


N 1 2HM2001 767 


HRIFA028983a 


NT2RM4000100 


HRIFA032389a 


IM 1 2 RP 1000239 


HRIFA005214a 


NT2RP1 000271 


HRIFA005240a 


NT2RP1 000465 


HRIFA005369a 


IM I2RP1 000468 


HRIFA005372a 


IM 1 2RP1 000679 


HRIFA005500a 


N 1 2RP1 000740 


HRIFA005540a 


IM I2RP2000092 


HRIFA006183a 


NT2RP2000178 


HRIFA006250a 


IM 1 2HP2000240 


HRIFA006298a 


MTOnRAAAA^W A 

IM 1 2HP2000610 


HRIFA006566a 


MTADnAAAATJ A ! 

N I2RP2000712 


HRIFA006649a 


MTOnnAAAA-7AA ' 

NI2RP2000739 


HRIFA006667a 


MTOHRAAAJ AAA I 

fM I2RP2001223 J 


HRIFA007032a 


k ita n n aaa j /-v— « a 1 

NT2RP2001276 


HRIFA007068a 


NT2RP2001388 j 


HRIFA007152a 


NI2RP2001538 


HRIFA007262a 


NI2RP2001662 | 


HRIFA007352a 


NT2RP2001817 


HRIFA007463a 


K IXA RHa aa J r>n\j 

IMT2RP2001921 | 


HRIFA007547a 


IMT2RP2003138 


HRIFA008426a 


NI2RP2003302 


HRIFA008547a 




HRIFA008981a | 


NT2RP2003950 j 


HRIFA008989a 


NT2RP2004069 


HRlFA009071a 


NT2RP2004108 


HRIFA009101a 


NT2RP2005069 


HRIFA009762a 


NT2RP2005391 


HRIFA009983a 


NT2RP2005535 


HRIFA01 0085a 


NT2RP2005666 


HRIFA010176a 


NT2RP2005774 


HRIFA01 5063a 


NT2RP2006092 


HRIFA01 0460a 


NT2RP2006134 


HRIFA01 0490a | 
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Table 23 


(continued) 


5 


The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) 
"transcription regulation" and "activator", "transcription regulation" and "repressor", or "nuclear protein" and 
"repressor", and the selected clones 




name of clone 


name of representative sequence 




NT2RP3000148 

■ VI *B- | f 1 X^ X^ X^ X^ P » X^ 


HRIFA022249a 

■ ■ * ■■■ ■ 1 X^ 4 k fc I X^ LA 




NT2RP3000232 

■ « ■ ■ ■ I X^ X^ V* X** 


HRIFA022564a 

■ >t ■■■ ■ 1 X» fc- V X^ F X^* 


10 


NT2RP3000378 


HRIFA022671a 

■ ■■ ■■■ r 1 X^ 4^_- *w XV ' P U 


NT2RP3000427 

I « I R ■ 1 X^ X^ X^ T V 


HRIFA025033a 

p pi p p p r i xv«wxV VWU 




NT2RP3000652 

WWW *w 1 ■ f X^ X* X' X^XV«W 


HRIFA022895a 

P ■ 1 • P P F X^ V XV 




NT2RP3000677 

c X I f t f X^ x^ x^ »^ X* r 


HRIFA023007a 

p p p p p p r » v *— x-« r xv v » v« 




NT2RP3001271 


HRIFA023634a 

■ ■ 1 I ■ 1 ■ * X^ * X^ X^ X^ T Vj-+ 


15 


NT2RP3001650 


HRIFA026923a 

■ ■ 1 ■ ■ < ■ IW X* Xj» X^ T ^ 




NT2RP3001976 

■ VI 4 ■ 1 ■ ■ X^ X^ X^ ■ X^ * X^ 


HRIFA026496a 

P PI III \S t— X^ ¥ W U Is^*- 




NT2RP3002286 


HRIFA024185a 




NT2RP3002353 


HRIFA024893a 

1 II 111 rlW^~ xv xJxJ C* 


20 


NT2RP3002664 

1 V ■ 4_ 1 |I XV XV X^ W F 


HRIFA024305a 

1 II III * lwA_~w WU 


NT2RP3004025 

P W P b 1 U XVXVXV ¥ X^ X/ 


HRIFA025290a 

1 II III * ■ V*p ^V «W \V XV P— A. 




NT2RP3004119 

1 v 1 l 1 1 xv »V XV r ■ | v-» 


HRIFA025695a 

i ii 1 1 i nv4>ww vx/u 




OVARC 1000995 

X«V » W 11 IX/ ■ W w V 


HRIFA011512a 

1 II III r * XV 1 1 xV 1 Ck- U 




OVARC 1001132 

x^V » rii IV 1 XV V 1 1 


HRIFA022707a 

1 II III r\xV^_£_ r \v f C4 


25 


OVARC1 001596 

x/ V # x 1 1 x^^ 1 W xv 1 xV x/ 


HRIFA01 9437a 

1 II III r % xv 1 <v™W » d 




OVARC1001725 


HRIFA01 9958a 

• ii til rvv i v/ v/wuu 




OVARC1001952 


HRIFA01 9466a 

1 II III xv I v~UUC* 




OVARC1002178 


HRIFA021 007a 


30 


PLACE1 000258 


HRIFA01 1820a 

1 f 1 III #• XV I 1 ut»vu 


PLACE 1000442 


HRIFA01 1947a 




PLACE1 000907 


HRIFA01 2278a 

1 1 1 111 r\Vv P b w / ivd 




PLACE 1003529 

1 PW/ * ' l__ 1 XV X*r X-/ 


HRIFA01 3980a 

1 II III r%\V 1 x*J xv l/VQ 




PLACE1 003598 

1 ^» » l_ ■ xVxVWhvk/x«* 


HRIFA01 4024a 

1 II III rlw 1 \V^_ ^ d 




PLACE1004166 

1 1 — / » PW 1 xV \# 1 W xV 


HRIFA01 4396a 

1 II III 9 \xv 1 *T U<JUU 




PLACE1004168 


HRIFA01 4397a 

1 II III rl xv 1 Tww / C4 




PLACE1 004519 


HRIFA01 4620a 




PL ACE 1005239 

1 l_#\\«vi_ 1 xvxVw«Wxvv 


HRIFA015122a 

1 II 111 l\\J 1 1 f f,Q 




PLACE 1005682 


HRIFA01 5423a 

i ii ill / » w i ^Ti-wa 


40 


PLACE 1006208 

f l_f » %»V L^. I XVxVxVf-r 


HRIFA01 5756a 

1 II III #% w 1 xv / wUU 




PLACE1 006515 


HRIFA01 5947a 

1 II III r*w 1 xv xv » # C4 




PLACE 1007028 

■ 1— * » v/ I — ■ V * xv«w x-/ 


HRIFA01 6255a 

1 II III rlw 1 UCwx/Ct 




PLACE1 007591 


HRIFA01659Qa 

1 II ill r\u Ivvxvx7x7d 


AK. 


PLACE1 008549 


HRIFA017190a 

t II 111 w\\J 1 / 1 C/wQ 




PLACE1 009735 


HRIFA017921a 




PLACE1011407 


HRIFA018931a 




PLACE1011978 


HRIFA019262a 




THYR01 000580 


HRIFA029208a 


50 


THYR01 000964 


HRIFA029398a 




THYRO1001641 


HRIFA030472a 




Y79AA1 000030 


HRIFA025936a 




179AA 1000750 


HRIFA028187a 


55 


Y79AA1001212 


HRIFA005296a 


Y79AA1 002058 


HRIFA032161a 




Y79AA1 002121 


HRIFA032186a 
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Table 23 (continued) 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) I 
transcnption regulation" and "activator", -transcription regulation" and "repressor-, or "nuclear protein" and 
repressor", and the selected clones 


name of clone 


name of representative sequence 


Y79AA1002129 
Y79AA1 002334 
Y79AA1 002378 


HRIFA01 1926a 
HRIFA032257a 
HRIFA032274a 



Table 24 



The representative sequences of the most homologous sequences in the SwissProt with the keyword(s) "disease 
mutation", or "syndrome", and the selected clones 


name of clone 


name of representative sequence 


HEMBA1 000732 


HRIFA000683a 


HEMBA1 000835 


HRIFA000776a 


HEMBA1 004391 


HRIFA020693a 


MAMMA1001623 


HRIFA027179C 


NT2RM2000497 


HRIFA021781a 


NT2RM2000632 


HRIFA022166a 


NT2RP1 000579 


HRIFA005438a 


NT2RP2001903 


HRIFA007532a 


NT2RP2005878 


HRIFA008186a 


NT2RP3002411 


HRIFA005781a 


NT2RP3002737 


HRIFA024132a 


NT2RP4002187 


HRIFA030342a 


PLACE1 000033 


HRIFA011659a 


PLACE1 001500 


HRIFA01 2692a 


PLACE1 005815 


HRIFA01 5506a 


PLACE1 008657 


HRIFA01 7257a 


PLACE1 010713 


HRIFA024504a 



Table 25 



The clones selected by the keyword(s) 



name of clone 



BNGH41 000020 
BNGH41 000087 
BNGH41 000091 
HEMBA1000121 
HEMBA1000128 
HEMBA1 000275 
HEMBA1 000349 
H EM BA 1000443 
HEMBA1 000462 
HEMBA1 000477 
HEMBA1 000590 
HEMBA1 000634 
HEMBA1 000671 



of the top hit data in the SwissProt, and having the maximal score in the 
ATGpM 0.5 or higher. 



name of sequence 



F-BNGH41 000020 
F-BNGH41 000087 
F-BNGH41 000091 
F-HEMBA1000121 
F-HEMBA1 000128 
F-HEMBA1 000275 
F-HEMBA1 000349 
F-HEMBA1 000443 
F-HEMBA1 000462 
F-HEMBA1 000477 
F-HEMBA1 000590 
F-HEMBA1 000634 
F-HEMBA1 000671 



maximal ATGpM score 



0.94 

0.57 

0.81 

0.94 

0.81 

0.83 

0.52 

0.56 

0.75 

0.55 

0.75 

0.94 

0.65 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGprl 0.5 or higher 




name of clone 


name of sequence 


maximal ATGprl score 


HEMBA1 000745 


F-HEMBA1 000745 


0.78 


HEMBA1 000835 


F-HEMBA1 000835 


0.64 


HEMBA1000907 


F-HEMBA1 000907 


0.94 


HEMBA1 000940 


F-HEMBA1 000940 


0.86 


HEMBA1001184 


F-HEMBA1 001184 


0.54 


HEMBA1001221 


F-HEMBA1 001221 


0.89 


HEMBA1001228 


F-HEMBA1 001228 


0.94 


HEMBA1001390 


F-HEMBA1 001390 


0.90 


HEMBA1001621 


F-HEMBA1001621 


0.94 


HEMBA1001878 


F-HEMBA1001878 


0.92 


HEMBA 1002048 


F-HEMBA1 002048 


0.71 


HEMBA1002131 


F-HEMBA1002131 


0.94 


HEMBA1002164 


F-HEMBA1002164 


0.74 


HEMBA1002167 


F-HEMBA1002167 


0.94 


HEMBA1002178 


F-HEMBA1002178 


0.91 


HEMBA1002316 


F-HEMBA1002316 


0.73 


HEMBA1 002421 


F-HEMBA1 002421 


0.90 


HEMBA1 002524 


F-HEMBA1 002524 


0.80 


HEMBA1 002551 


F-HEMBA1 002551 


0.60 


HEMBA 1002767 


F-HEMBA1 002767 


0.94 


HEMBA 1002985 


F-HEMBA1 002985 


0.90 


HEMBA1 002992 


F-HEMBA1 002992 


0.76 


HEMBA1 003047 


F-HEMBA1 003047 


0.70 


HEMBA1 003072 


F-HEMBA1 003072 


0.94 


HEMBA1003101 


F-HEMBA1003101 


0.94 


HEMBA1 003230 


F-HEMBA1 003230 


0.77 


HEMBA1003315 


F-HEMBA1003315 


0.76 


HEMBA1 003392 


F-HEMBA1 003392 


0.94 


HEMBA1 003487 


F-HEMBA1 003487 


0.90 


HEMBA1 003497 


F-HEMBA1 003497 


0.61 


HEMBA1 003530 


F-HEMBA1 003530 


0.94 


HEMBA1 003945 


F-HEMBA1 003945 


0.94 


HEMBA1 004085 


F-HEMBA1 004085 


0.59 


HEMBA1 004250 


F-HEMBA1 004250 


0.83 


HEMBA1 004391 


F-HEMBA1 004391 


0.94 


HEMBA1 004444 


F-HEMBA1 004444 


0.94 


HEMBA 1004454 


F-HEMBA1 004454 


0.50 


HEMBA1 004505 


F-HEMBA1 004505 


0.94 


HEMBA1 004785 


F-HEMBA1 004785 


0.89 


HEMBA1 004797 


F-HEMBA1 004797 


0.66 


HEMBA1 004982 


F-HEMBA1 004982 


0.55 


HEMBA1 005070 


F-HEMBA1 005070 


0.94 


HEMBA1 005084 


F-HEMBA1 005084 


0.89 


HEMBA1005145 


F-HEMBA1 005145 


0.85 


HEMBA1 005337 


F-HEMBA1 005337 


0.62 


HEMBA1 005449 


F-HEMBA1 005449 


0.91 


HEMBA1 005489 


F-HEMBA1 005489 


0.70 
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Table 25 (continued) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



The clones selected by the keyword(s) of the top hit data in the SwissProt, 

ATGprl 0.5 or higher. 



and having the maximal score in the 



name of clone 


| name of sequence 


HEMBA1 005522 


1 F-HEMBA1 005522 


HEM BA1 005545 


F-HEMBA1 005545 


HEMBA1 005698 


F-HEMBA1 005698 


H EM BA 1005929 


F-HEMBA1 005929 


HEM BA1 005945 


F-HEMBA1 005945 


HEMBA1 006276 


F-HEMBA1 006276 


HEMBA1 006299 


F-HEMBA1 006299 


HEMBA1 006335 


F-HEMBA1 006335 


HEMBA1 006430 


F-HEMBA1 006430 


HEMBA1 006482 


F-HEMBA1 006482 


HEMBA1006517 


F-HEMBA1006517 


H EM BA 1006544 


F-HEMBA1 006544 


HEMBA1 006572 


F-HEMBA1 006572 


HEMBA1 006707 


F-HEMBA1 006707 


HEMBA1 006724 


F-HEMBA1 006724 


HEMBA1 006749 


F-HEMBA1 006749 


H EM BA 1006902 


F-HEMBA1 006902 


HEMBA1006916 


| F-HEMBA1006916 


HEMBA1007013 


F-HEMBA1007013 


HEMBA1 007057 


F-HEMBA1 007057 


HEMBA1007226 


F-HEMBA1 007226 


HEMBA1 007241 


F-HEMBA1 007241 


HEMBB1000106 


F-HEMBB1000106 


HEMBB1 000447 


F-HEMBB1 000447 


HEMBB1 000668 


F-HEMBB1 000668 


HEMBB1 000679 


F-HEMBB1 000679 


HEMBB1 000881 


F-HEMBB1 000881 


HEMBB1001026 


F-HEMBB1 001026 


HEMBB1001048 


F-HEMBB1001048 


HEMBB1001200 


F-HEMBB1 001200 


HEMBB1001573 


F-HEMBB1 001573 


HEMBB1001847 


F-HEMBB1001847 


HEMBB1001959 


F-HEMBB1Q01959 


HEMBB1 002041 


F-HEMBB1 002041 


HEMBB1 002051 


F-HEMBB1 002051 


HEMBB1 002302 


F-HEMBB1 002302 


HEMBB1 002427 


F-HEMBB1 002427 


HEMBB1 002661 


F-HEMBB1 002661 


MAMMA1000106 


F-MAMMA1000106 


MAMMA1 000204 


F-MAMMA1 000204 


MAMMA1 000226 


F-MAMMA1 000226 


MAMMA1 000403 


F-MAMMA1 000403 


MAMMA1 000473 


F-MAMMA1 000473 


MAMMA1 000496 


F-MAMMA1 000496 


MAMMA1 000591 


F-MAMMA1 000591 


MAMMA1 000681 


F-MAMMA1 000681 


MAMMA1 000788 


F-MAMMA1 000788 



maximal ATGprl score 
073 
0.94 
0.68 
0.72 
0.80 
0.50 
0.94 
0.94 
0.93 
0.59 
0.68 
0.94 
0.62 
0.94 
0.80 
0.94 
0.94 
0.80 
0.82 
0.94 
0.50 
0.94 
0.94 
0.73 
0.50 
0.91 
0.77 
0.94 
0.88 
0.81 
0.80 
0.81 
0.94 
0.79 
0.60 
0.89 
0.94 
0.94 
0.78 
0.94 
0.94 
0.59 
0.86 
0.70 
0.77 
0.94 
0.83 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the Swiss Prot, and having the maximal score in the 




ATGprl 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 


MAMMA1 000814 


F-MAMMA1000814 


0.94 


MAMMA1 000881 


F-MAMMA1 000881 


0.51 


MAMMA1001043 


F-MAMMA1001043 


0.81 


MAMMA1001094 


F-MAMMA1001094 


0.80 


MAMMA1001150 


F-MAMMA1001150 


0.91 


MAMMA1001237 


F-MAMMA1001237 


0.52 


MAMMA1001532 


F-MAMMA1001532 


0.52 


MAMMA1001615 


F-MAMMA1001615 


0.94 


MAMMA1001634 


F-MAMMA1001634 


0.94 


MAMMA1001893 


F-MAMMA1001893 


0.90 


MAMMA1001957 


F-MAMMA1001957 


0.94 


MAMMA1 002070 


F-MAMMA1 002070 


0.82 


MAMMA1 002080 


F-MAMMA1 002080 


0.72 


MAMMA1 002091 


F-MAMMA1 002091 


0.70 


MAMMA1 002095 


F-MAMMA1 002095 


0.85 


MAMMA1002128 


F-MAMMA1002128 


0.79 


MAMMA1 002234 


F-MAMMA1 002234 


0.94 


MAMMA1 002586 


F-MAMMA1 002586 


0.67 


MAMMA1 002633 


F-MAMMA1 002633 


0.94 


MAMMA1003126 


F-MAMMA1003126 


0.94 


NT2RM1 000407 


F-NT2RM1 000407 


0.94 


NT2RM1 000462 


F-NT2RM1 000462 


0.94 


NT2RM1 000542 


F-NT2RM1 000542 


0.94 


NT2RM1 000580 


F-NT2RM1 000580 


0.94 


NT2RM1 000789 


F-NT2RM1 000789 


0.82 


NT2RM1 000855 


F-NT2RM1 000855 


0.94 


NT2RM1 000858 


F-NT2RM1 000858 


0.50 


NT2RM1 000899 


F-NT2RM 1000899 


0.78 


NT2RM2000410 


F-NT2RM2000410 


0.82 


NT2RM2000423 


F-NT2RM2000423 


0.66 


NT2RM2000565 


F-NT2RM2000565 


0.56 


NT2RM2000582 


F-NT2RM2000582 


0.78 


NT2RM2000589 


F-NT2RM2000589 


0.86 


NT2RM2000622 


F-NT2RM2000622 


0.94 


NT2RM2000632 


F-NT2RM2000632 


0.73 


NT2RM2000773 


F-NT2RM2000773 


0.64 


NT2RM2001126 


F-NT2RM2001126 


0.87 


NT2RM2001558 


F-NT2RM2001558 


0.94 


NT2RM2001626 


F-NT2RM2001626 


0.68 


NT2RM2001738 


F-NT2RM2001738 


0.94 


NT2RM2001767 


F-NT2RM2001767 


0.82 


NT2RM2001818 


F-NT2RM2001818 


0.70 


NT2RM2001902 


F-NT2RM2001902 


0.69 


NT2RM2001939 


F-NT2RM2001939 


0.55 


NT2RM2001941 


F-NT2RM2001941 


0.94 


NT2RM4000100 


F-NT2RM4000100 


0.74 


NT2RM4000198 


F-NT2RM4000198 


0.85 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 

ATGprt 0.5 or higher. 



name of clone 


i nail it? Ul oc^ucllCc 


maximal ATGprl score 


NT2RM4000284 




0.89 


NT2RM4000295 




0.74 


NT2RM4000417 




0.82 


NT2RM4000444 


s r-i\i i ^nivi^uuu^H^f 


0.94 


NT2RM4000587 


p MTODM/1 onr»CQ*7 


0.94 


NT2RM4000648 


1 P MTOQU/Mnnne/iQ 
r-lM l c^M*MJUUb4o 


0.88 


NT2RM4000965 


1 P KlTOQkAAr\r\r\ftczc 


0.60 


NT2RM4001377 


1 P MTODM/trirn n~r~t 


0.82 


NT2RM4001735 


r-fM 1 ^MiVI4UUl /Sb 


0.94 


NT2RM4002352 

* V I iwl Iff l~V VbVV^ 


1 P MTOQM/tfV>ooeo 


0.83 


NT2RP1 00000? 


r-N 1 linri 000002 


0.76 


NT2RP10000SO 


r-N 1 inr 1 000050 


0.54 


NT2RP1000181 


1 P MTODDH nr\r\4 oh 
r-N 1 cnrlUOOio 1 


0.73 


NT2RP100023Q 


1 P MTO D D H ftftftOOft 

r-N 1 ccnP i UU(Jc\39 


0.94 


NT2RP1 000261 


r-N 1 *£nr iUUU2oi 


0.57 


NT2RP 1000300 




0.94 


NT2RP1 0004fi^ 


r-N 1 c^nr 1 000465 


0.94 


NT2RP1 nnfi<ifift 

IH 1 CI 11 1 UUuHUO 


r-N 1 2HP1 000468 


0.94 


NT2RP1 000*^1 

t M 1 LI If 1 vuvww 1 


r-N I 2HP1 000551 


0.94 


NT2RP100057Q 


r-N I ^nr 1000579 


0.94 


NT2RP1000R13 


r-IN 1 dnri 000613 


0.90 


NT2RP100067Q 

" 1 tl II 1 VSV/ISU / v7 


! r-IM 1 ^nr 1000679 


0.68 


NT2 RP 1 00074.0 

i ■ i ci in i uuy / «+u 


r-N 1 2nr 1 000740 


0.80 


NT?RPinnflQR1 


r-N 1 2Hr 1 000981 


0.94 


NT2RP1001 nn4 i 

i » i ci ir i vU i u v*+ 


h-N 1 2Hr 1 001 004 


0.74 


NT2RP10010P0 

1 * 1 CI IF 1 UU 1 1/C.W 


r-N 1 <fHr 1 001 020 


0.70 


NT2RP1001031 

1* I CI II 1 \J\J 1 vO 1 


P MTODDH aa4 noH 

r-N I c^rirl U01031 


0.59 


IMT2R P20000Q? 

■ * I CI II CUUUV-3C 


r-N I ^Hr2000092 


0.88 


NT?RP?00niQ4 


r-N 1 2HP2000394 


0.94 


NT2RP2000447 

l » 1 CI 11 CvUvHH 1 


1 MTADDOAAAjI i"7 

r-N t ^Hr2000447 


0.50 


NT2RP2000514 

1 " 1 CI 11 C\J\J\J>J It i 


IT MTODDAAAAC4 A 

r-N 1 2Hr^000514 


0.82 


NT2RP2000533 


P MTOQDOOAflCOO 

r-N 1 ^irir^UUUooo 


0.94 


NT2RP2000610 

■ « 1 CI 1 1 CUUUU 1 U I 


C MTODDOAAACH A 

r-N 1 ^Hr2000610 


0.94 1 


NT2RP2000616 

i » ■ c i ii cuuvu lu 1 


f~ MTO DDI Aft AC < ^ 

r-N 1 c'HrVU0061 6 


0.80 


NTPRPPOflflRclQ 

i » i CI 11 CUUUUH9 I 


r-N 1 2HP2000649 


0.53 


NT2RP200flRfi^ 


r-N 1 4dHr2000663 


0.88 


NT2RP2G006Q4 

1 * i ci ii Cuuuuat 1 


C MTODDAAAACAii 

r-N 1 <fnr2000694 


0.74 


NT2RP2000818 


F-NT2RP200081 8 


r\ ca 


NT2RP2000903 


F-NT2RP2000903 


0.59 


NT2RP2001200 


F-NT2RP2001200 


0.88 


NT2RP2001223 I 


F-NT2RP2001223 


0.92 


NT2RP2001276 


F-NT2RP2001276 


0.59 


NT2RP2001480 | 


F-NT2RP2001480 


0.85 


NT2RP2001495 


F-NT2RP2001495 


0.78 


NT2RP2001514 


F-NT2RP2001514 


0.78 


NT2RP2001529 


F-NT2RP2001529 


0.94 


NT2RP2001538 


F-NT2RP2001538 


0.89 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the Swiss Prot, and having the maximal score in the 




ATGpM 0.5 or higher. 




name of clone 


name of sequence 

• 


maximal ATGprl score 


NT2RP2001662 


F-NT2RP2001662 


0.94 


NT2RP2001755 


F-NT2RP2001755 


0.94 


NT2RP2001769 


F-NT2RP2001769 


0.66 


NT2RP2001878 


F-NT2RP2001878 


0.68 


NT2RP2001921 


F-NT2RP2001921 


0.61 


NT2RP2001948 


F-NT2RP2001948 


0.89 


NT2RP2001956 


F-NT2RP2001956 


0.74 


NT2RP2002063 


F-NT2RP2002063 


0.94 


NT2RP2002188 


F-NT2RP2002188 


0.78 


NT2RP2002232 


F-NT2RP2002232 


0.90 


NT2RP2002304 


F-NT2RP2002304 


0.94 


NT2RP2002409 


F-NT2RP2002409 


0.94 


NT2RP2002527 


F-NT2RP2002527 


0.58 


NT2RP2002533 


F-NT2RP2002533 


0.87 


NT2RP2002564 


F-NT2RP2002564 


0.94 


NT2RP2002942 


F-NT2RP2002942 


0.66 


NT2RP2002976 


F-NT2RP2002976 


0.94 


NT2RP2003042 


F-NT2RP2003042 


0.93 


NT2RP2003179 


F-NT2RP2003179 


0.94 


NT2RP2003210 


F-NT2RP2003210 


0.61 


NT2RP2003302 


F-NT2RP2003302 


0.79 


NT2RP2003369 


F-NT2RP2003369 


0.93 


NT2RP2003390 


F-NT2RP2003390 


0.79 


NT2RP2003469 


F-NT2RP2003469 


0.90 


NT2RP2003545 


F-NT2RP2003545 


0.55 


NT2RP2003593 

■ V t mam 1 WW Mm— *r 


F-NT2RP2003593 


0.94 


NT2RP2003655 


F-NT2RP2003655 


0.83 


NT2RP2003664 


F-NT2RP2003664 


0.89 


NT2RP2004069 


F-NT2RP2004069 


0.76 


NT2RP2004108 


F-NT2RP2004108 


0.91 


NT2RP2004141 

t w r mmm w WW mmm mm IP 


F-NT2RP2004141 

W P W P Wmm P VP Mmm * P WW 


0.53 


NT2RP2004447 


F-NT2RP2004447 


0.93 


NT2RP2004606 


F-NT2RP2004606 


0.94 


NT2RP2004648 


F-NT2RP2004648 


0.94 


NT2RP2004670 


F-NT2RP2004670 


0.94 


NT2RP2004794 


F-NT2RP2004794 


0.65 


NT2RP2004847 

■ VP mmm 1 P P immr ■ ■ ■ 


F-NT2RP2004847 

P P V 1 *— » P 9 W ^» ^» P P ■ 


0.94 


NT2RP2005069 


F-NT2RP2005069 


0.89 


NT2RP2005163 


F-NT2RP2005163 


0.79 


NT2RP2005181 


F-NT2RP2005181 


0.87 


NT2RP2005247 


F-NT2RP2005247 


0.77 


NT2RP2005425 


F-NT2RP2005425 


0.77 


NT2RP2005535 


F-NT2RP2005535 


0.51 


NT2RP2005597 


F-NT2RP2005597 


0.74 


NT2RP2005632 


F-NT2RP2005632 


0.87 


NT2RP2005666 


F-NT2RP2005666 


0.77 


NT2RP2005774 


F-NT2RP2005774 


0.87 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 

ATGprl 0.5 or higher. 



name of clone 


name of sequence 


NT2RP2005878 


F-NT2RP2005878 


NT2RP2005883 


F-NT2RP2005883 


NT2RP2005941 


F-NT2RP2005941 


NT2RP2005994 


F-NT2RP2005994 


NT2RP2006004 


F-NT2RP2006004 


NT2RP2006042 


F-NT2RP2006042 


NT2RP2006099 


F-NT2RP2006099 


NT2RP2006512 


F-NT2RP2006512 


NT2RP3000011 


F-NT2RP3000011 


NT2RP3000022 


F-NT2RP3000022 


NT2RP3000059 


F-NT2RP3000059 


NT2RP3000063 


F-NT2RP3000063 


NT2RP3000171 


F-NT2RP3000171 


NT2RP3000172 


F-NT2RP3000172 


NT2RP3000201 


F-NT2RP3000201 


NT2RP3000232 


F-NT2RP3000232 


NT2RP3000304 


F-NT2RP3000304 


NT2RP3000378 


F-NT2RP3000378 


NT2RP3000436 


F-NT2RP3000436 


NT2RP3000444 


F-NT2RP3000444 


NT2RP3000616 


F-NT2RP3000616 


NT2RP3000645 


F-NT2RP3000645 


NT2RP3000676 


F-NT2RP3000676 


NT2RP3000721 


F-NT2RP3000721 


NT2RP3000838 


F-NT2RP3000838 


NT2RP3000871 


F-NT2RP3000871 


NT2RP3000907 


F-NT2RP3000907 


NT2RP3000921 


F-NT2RP3000921 


NT2RP3001061 


F-NT2RP3001061 


NT2RP3001159 


F-NT2RP3001159 


NT2RP3001170 


F-NT2RP3001170 


NT2RP3001195 


F-NT2RP30011 95 


NT2RP3001240 


F-NT2RP3001240 


NT2RP3001322 


F-NT2RP3001322 


NT2RP3001388 


F-NT2RP3001388 


NT2RP3001560 


F-N12RP3001560 


NT2RP3001592 


F-NT2RP3001592 


NT2RP3001650 


F-NT2RP3001650 


NT2RP3001738 


F-N12RP3001738 


NT2RP3002015 


F-NT2RP3002015 


NT2RP3002160 


F-NT2RP3002160 


NT2RP3002286 


F-NT2RP3002286 


NT2RP3002311 


F-NT2RP3002311 


NT2RP3002324 [ 


F-NT2RP3002324 


NT2RP3002342 


F-NT2RP3002342 


NT2RP3002353 ( 


F-NT2RP3002353 


NT2RP3002411 


F-NT2RP3002411 



maximal ATGprl score 



0.70 

0.94 

0.81 

0.62 

0.61 

0.69 

0.65 

0.94 

0.93 

0.55 

0.74 

0.78 

0.72 

0.93 

0.50 

0.61 

0.94 

0.56 

0.65 

0.94 

0.77 

0.91 

0.94 

0.82 

0.94 

0.94 

0.59 

0.64 

0.72 

0.94 

0.70 

0.94 

0.83 

0.62 

0.94 

0.94 

0.74 

0.73 

0.94 

0.93 

0.65 

0.82 

0.94 

0.92 

0.94 

0.76 

0.74 
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Table 25 (continued) 



10 



20 



30 



35 



40 



45 



50 



55 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGpM 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 

• 


NT2RP3002448 


F-NT2RP3002448 


0.87 


NT2RP3002571 


F-NT2RP3002571 


0.61 


NT2RP3002664 


F-NT2RP3002664 


0.82 


NT2RP3002738 


F-NT2RP3002738 


0.81 


NT2RP3002790 


F-NT2RP3002790 


0.94 


NT2RP3002836 


F-NT2RP3002836 


0.72 


NT2RP3002887 


F-NT2RP3002887 


0.94 


NT2RP3002900 


F-NT2RP3002900 


0.88 


NT2RP3002958 


F-NT2RP3002958 


0.91 


NT2RP3002983 


F-NT2RP3002983 


0.92 


NT2RP3003000 


F-NT2RP3003000 


0.80 


NT2RP3003076 


F-NT2RP3003076 


0.65 


NT2RP3003354 


F-NT2RP3003354 


0.66 


NT2RP3003448 


F-NT2RP3003448 


0.61 


NT2RP3003473 


F-NT2RP3003473 


0.94 


NT2RP3003527 


F-NT2RP3003527 


0.94 


NT2RP3003532 


F-NT2RP3003532 


0.93 


NT2RP3003614 


F-NT2RP3003614 


0.81 


NT2RP3003729 


F-NT2RP3003729 


0.90 


NT2RP3003849 

■ ^ i wa^ w ww 9a^ m^ i 


F-NT2RP3003849 


0.93 


NT2RP3003939 


F-NT2RP3003939 


0.67 


NT2RP3004025 


F-NT2RP3004025 


0.83 


NT2RP3004067 


F-NT2RP3004067 

■ P 99 9 9 W 9 ^0 1 w m 


0.74 


NT2RP3004075 


F-NT2RP3004075 


0.77 


NT2RP3004090 


F-NT2RP3004090 

P P 93 9 P PJ P 1 X^ 


0.94 


NT2RP3004119 


F-NT2RP3004119 


0.57 


NT2RP3004130 


F-NT2RP3004130 


0.66 


NT2RP3004133 


F-NT2RP3004133 


0.94 


NT2RP3004202 


F-NT2RP3004202 


0.53 


NT2RP3004294 


F-NT2RP3004294 


0.71 


NT2RP3004309 


F-NT2RP3004309 


0.92 


NT2RP3004345 


F-NT2RP3004345 

p » ^ p p pp • ^» p mw 


0.73 


NT2RP3004406 


F-NT2RP3004406 


0.62 


NT2RP3004481 


F-NT2RP3004481 


0.76 


NT2RP3004552 


F-NT2RP3004552 

P ■ W 1 41k I WW 9^ 9^ 1 9t^9^^ 


0.78 


NT2RP3004557 


F-NT2RP3004557 

■ 1 f 1 * 1 W m T 9*0 9^f I 


0.78 


NT2RP3004625 


F-NT2RP3004625 


0.87 


NT2RP3004640 


F-NT2RP3004640 


0.82 


NT2RP3004647 


F-NT2RP3004647 


0.94 


NT2RP4000108 


F-NT2RP4000108 


0.94 


NT2RP4000634 


F-NT2RP4000634 


0.69 


NT2RP4000962 


F-NT2RP4000962 


0.67 


NT2RP4001009 


F-NT2RP4001009 


0.56 


NT2RP4001467 


F-NT2RP4001467 


0.94 


NT2RP4001877 


F-NT2RP4001877 


0.94 


NT2RP4001879 


F-NT2RP4001879 


0.82 


NT2RP4002187 


F-NT2RP4002187 


0.7 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 

ATGprl 0.5 or higher. 



name of clone 


name of sequence 


maximal ATGprl score 


NT2RP4002451 


F-NT2RP4002451 


0.53 


NT2RP4002750 


F-NT2RP4002750 


0.67 


OVARC 1000003 


F-OVARC 1000003 


0.62 


OVARC 1000 105 


F-OVARC1000105 


0.62 


OVARC1000137 


F-OVARC1000137 


0.72 


OVARC 1000255 


F-OVARC1 000255 


0.78 


OVARC 1000307 


F-OVARC1 000307 


0.94 


OVARC 10003 13 


F-OVARC 10003 13 


0.94 


OVARC1 000331 


F-OVARC1 000331 


0.57 


OVARC1 000410 


F-OVARC1 000410 


0.79 


OVARC 1000467 


F-OVARC1 000467 


0.83 


OVARC1 000529 


F-OVARC1 000529 


0.94 


OVARC 1000553 


F-OVARC1 000553 


0.94 


OVARC 1000873 


F-OVARC1 000873 


0.88 


OVARC1000916 


F-OVARC1000916 


0.94 


OVARC1 000956 


F-OVARC1 000956 


0.92 


OVARC1001030 


F-OVARC1001030 


0.94 


OVARC 100 1049 


F-OVARC1001049 


0.94 


OVARC1001086 


F-OVARC1001086 


0.73 


OVARC 1001 132 


F-OVARC1001132 I 


0.94 


OVARC1001163 


F-OVARC1001163 


0.75 


OVARC1001222 


F-OVARC1001222 


0.67 


OVARC1001336 


F-OVARC1001336 


0.92 


OVARC1001338 


F-OVARC1001338 


0.89 


OVARC1001570 


F-OVARC1001570 


1 0.94 


OVARC1001607 


F-OVARC1001607 


0.86 


OVARC1001725 


F-OVARC1001725 


0.81 


OVARC1001952 


F-OVARC1001952 


0.66 


OVARC1001991 


F-OVARC1001991 


0.94 


OVARC 1002058 


F-OVARC1 002058 


0.79 


PLACE 1000442 


F-PLACE1 000442 


0.93 


PLACE1 000740 


F-PLACE1 000740 


0.57 


PLACE1001016 


F-PLACE1001016 


0.92 


PLACE1001114 


F-PLACE1001114 


0.94 


PLACE1001123 


F-PLACE1001123 


0.89 


PLACE1001231 


F-PLACE1001231 


0.81 


PLACE1001340 


F-PLACE1001340 


0.77 


PLACE1001401 


F-PLACE1001401 


0.87 


PLACE1001407 


F-PLACE1001407 


0.94 


PLACE1001464 


F-PLACE1001464 


0.91 


PLACE1001500 


F-PLACE1001500 


0.77 


PLACE1001516 


F-PLACE1001516 


0.89 


PLACE1001564 


F-PLACE1001564 


0.52 


PLACE1001655 


F-PLACE1001655 


0.56 


PLACE1001795 


F-PLACE1001795 


0.91 


PLACE1001836 


F-PLACE1001836 


0.81 


PLACE1001918 


F-PLACE1001918 I 


0.76 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGpM 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 


PLACE1001949 


F-PLACE1001949 


0 94 


PLACE1 002080 


F-PLACE1 002080 


0.76 


PLACE 1002095 


F-PLACE1 002095 


0.61 


PLACE1002153 


F-PLACE1002153 


0.94 


PLACE1 002329 


F- PLACE 1002329 


0 94 


PLACE1 002355 


F-PLACE 1002355 


0 57 


PLACE1 002374 


F-PLACE1 002374 


0.92 


PLACE1 002547 


F-PLACE 1002547 


0.87 


PLACE 1002726 


F-PLACE 1002726 


0.83 


PLACE 1002905 


F-PLACE1 002905 

W 1 1 f * • X^ X^ X^ 


0 94 


PLACE1002911 


F-PLACE1 002911 


0 73 


PLACE 1003 135 


F-PLACE1003135 


0 69 


PLACE 1003 163 


F-PLACE1003163 


0 61 

w » W 1 


PLACE 1003428 


F-PLACE 1003428 


0 61 


PLACE 1003438 


F-PLACE 1003438 


0 93 


PLACE 1003460 


F-PLACE 1003460 


0 94 


PLACE 1003573 


F-PLACE 1003573 


0 78 

w ■ i w 


PLACE 1003598 


F-PLACE 1003598 


0 57 

w * W f 


PLACE 1003644 


F-PLACE 1003644 


0 93 


PLACE 1003737 


F-PLACE 1003737 


0 88 

W • WW 


PLACE 1003772 


F-PLACE1 003772 


0 80 


PLACE 1003852 


F-PLACE 1003852 


0 82 


PLACE 1004078 


F-PLACE1 004078 

1 1 L__#i ^/ 1 ww~w r W 


0 94 

w ■ w~ 


PLACE 10041 66 


F-PLACE1004166 

1 1 l — / \ V> 1 — 1 WW~ 1 \J\J 


0 92 


PLACE1004168 


F-PLACE1004168 

1 1 /* \^ l — t VW^T 1 WW 


0 58 

W .WW 


PLACE 1004279 


F-PLACE 1004279 


0 64 

V • w^ 


PLACE1 004441 


F-PLACE 1004441 


0 90 

w . w w 


PLACE 1004450 


F-PLACE1 004450 


0 85 


PLACE 1004482 


F-PLACE1 004482 


0 57 

w . w / 


PLACE1 004492 


F-PLACE1 004492 


0 85 


PLACE1004519 


F-PLACE1004519 


0 74 


PLACE1 004520 


F-PLACE1 004520 

1 ■ W%\/l» ■ W~WwW 


0 94 


PLACE 1004630 


F-PLACE1 004630 


0 76 


PLACE 1004648 


F-PLACE1 004648 


0 59 


PLACE1 004816 


F-PLACE1004816 


0 94 

w • w~ 


PLACE 1004887 


F-PLACE1 004887 

1 I I— 1 WW" WW § 


0 94 

W - W" 


PLACE 1005003 


F-PLACE 1005003 

1 1 I— \m* i— 1 wwwwww 


0 80 

w . w w 


PLACE 1005031 


F-PLACE1 005031 

• 1 L«« * L_ t WWW I 


0 88 


PLACE 1005239 


F-PLACE 1005239 


0.94 


PLACE 1005383 


F-PLACE 1005383 


0.52 


PLACE1 005426 


F-PLACE1 005426 


0.52 


PLACE 100551 9 


F-PLACE1005519 


0.69 


PLACE 1005544 


F-PLACE1 005544 


0.59 


PLACE 1005569 


F-PLACE1 005569 


0.68 


PLACE 1005682 


F-PLACE1 005682 


0.64 


PLACE 1005736 


F-PLACE 1005736 


0.67 


PLACE1005815 


F-PLACE1005815 


0.94 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGpM 0.5 or higher. 




iiaiiic ui oorie 


name ot sequence 


maximal ATGprl score 


rLAUb 1 0058/8 


F-PLACE 1005878 


0.92 


Dl APCinncoo7 
rLMUbl 00592 / 


F-PLACE 1 005927 


0.94 


rLMOtl UUOU/o 


F-FLACE1 006073 


0.94 


rLMU 1 1 UUo20o 


F-PLACE1 006208 


0.75 


rLAUhl 0062/ / 


F-PLACE1 006277 


0.65 


rLAUbl UUb^yO 


F-PLACE 1006290 


0.94 


rLAUbl 006443 


F-PLACE 1006443 


0.89 


r LAUb 1 006/1 6 


F-PLACE1006716 


0.63 


Dl APCmrtcocn 

r lau 1 1 uooyoy 


F-PLACE 1006959 


0.89 


rLAUb 100/028 


F-PLACE1 007028 


0.94 


rLAUbl 00 /081 


F-PLACE1 007081 


0.77 


rLAUb 1 00/096 


F-PLACE 1007096 


0.84 


rLAUbl 00/702 


F-PLACE1 007702 


0.51 


rLAUbl 006282 


F-PLACE1 008282 


0.70 


r LAU b 1 00o2y / 


F-PLACE1 008297 


0.64 


rLAUbl 008469 


F-PLACE 1008469 


0.94 


r LAU b 1 008549 


F-PLACE1 008549 


0.52 


rLAUb 1 008657 


F-PLACE1 008657 


0.94 


rLAUbl 00871 6 


F-PLACE1 008716 


0.79 


rLAUb 1008744 


F-PLACE1 008744 


0.94 


rLAUbl 0Uo9o4 


F-PLACE1 008984 


0.74 


rLAUbl 009279 


F-PLACE1 009279 


0.60 


PLAC b 1 009527 


F-PLACE1 009527 


0.87 


r LAUbl 009546 


F-PLACE1 009546 


0.80 


rLAUbl 009600 


F-PLACE1 009600 


0.76 


rLAUblOIOOll 


F-PLACE1010011 


0.94 


rLAUbl 01 00/8 


F-PLACE1010078 


0.86 


rLAUbl 01 0081 


F-PLACE1010081 


0.92 


rLAUbl 010251 


F-PLACE1010251 


0.62 


rLAUbl 01 0445 


F-PLACE1010445 


0.94 


HLAUbl010713 


F-PLACE1010713 


0.62 


Dl APC< A^ A007 

rLAUbl 010827 


F-PLACE1010827 


0.88 


rLAUbl 01 0968 


F-PLACE1010968 


0.51 


r LAUbl 01 1 045 


F-PLACE1011045 


0.64 


Dl APr^n^Hc 
rLAUbl01l116 


F-PLACE1011116 


0.69 


rLAUblOill 81 


F-PLACE1011181 


0.78 


rLAUbl 011236 


I~~ 1 A A"~~\ W J y^ _j _» a. — . 

F-PLACE1011236 


0.66 


rLAUbl 01 1364 


F-PLACE1011364 


0.94 


PLACE1011407 


F-PLACE1011407 


0 64 


PLACE1011516 


F-PLACE1011516 


0.83 


PLACE1011708 


F-PLACE1011708 


0.94 


PLACE1011824 


F-PLACE1011824 


0.94 


PLACE3000181 


F-PLACE3000181 


0.74 


SKNMC1 000014 


F-SKNMC1 000014 


0.76 


SKNMC1 000082 


F-SKNMC1 000082 


0.74 


THYRO1000196 


F-THYRO1000196 


0.71 


THYR01 000400 


F-THYRQ1 000400 


0.73 



98 



EP1 130 094 A2 



Table 25 (continued) 



10 



20 



30 



35 



40 



45 



50 



55 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGprl 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 


THY R0 1000584 


F-THYR01 000584 


0.73 


THYR01 000678 


F-THYR01 000678 


0.86 


THYR01 000776 


F-THYR01 000776 


0.91 


THYR01 000795 


F-THYR01 000795 


0.92 


THYR01 000846 


F-THYR01 000846 


0.78 


THYR01 000866 


F-THYROI 000866 


0.56 


THYR01 000956 


F-THYR01 000956 


0.94 


THYR01 000964 


F-THYR01 000964 


0.80 


THYRO1001063 


F-THYRO1001063 


0.87 


THYRO1001071 


F-THYRO1001071 


0.94 


THYRO1001102 


F-THYRO1001102 


0.90 


THYRO1001113 


F-THYR01 001113 


0.74 


THYRO1001128 


F-THYRO1001128 


0.79 


THYRO1001205 


F-THYRO1001205 


0.94 


THYRO1001242 


F-THYRO1001242 


0.94 


THYRO1001266 


F-THYRO1001266 


0.94 


THYRO1001456 


F-THYRO1001456 


0 69 


THYRO1001457 


F-THYRO1001457 


0.88 


THYRO1001471 


F-THYRO1001471 

P 1 1 F ■ P ■ ■ f T ¥ I 


0.91 


THYRO1001478 

■ 1 V ■ W W ¥ X^ P ¥ P 


F-THYRO1001478 


0 89 

■ %pr%^ 


THYRO1001529 


F-THYRO1001529 


0.55 


THYRO1001593 


F-THYRO1001593 

P P P 9 P 1 ■ P 1 X^ 


0.94 


THYRO1001608 


F-THYRO1001608 


0 94 


THYRO1001641 


F-THYRO1001641 

■ P P ■ P P V P XK P X^ T V 


0.94 


THYRO1001700 


F-THYRO1001700 

■ VI 9 III 1 X^ 1 » X* 


0.76 


THYRO1001702 


F-THYRO1001702 

■ » ■ p ■ p p ■ i * x^ ■ ■ 


0.80 


THYRO1001770 


F-THYRO1001770 


0.73 


THYRO1001803 


F-THYRO1001803 


0.94 


Y79AA1 000030 


F-Y79AA1 000030 

I 9 J \J 111 ■ 1 X^ X^ X* X# X^ 


0.88 


Y79AA1 000270 


F-Y79AA1 000270 


0 63 


Y79AA1 000426 


F-Y79AA1 000426 


0.92 


Y79AA1 000750 


F-Y79AA1 000750 


0.94 


Y79AA1 000777 


F-Y79AA1 000777 


0.94 


Y79AA1 000876 


F-Y79AA1 000876 


0.94 


Y79AA1 000888 

p P x^ r i r i ■ »^p^ x^ 


F-Y79AA1 000888 

I 9 m \J • » * ■ 1 X^ p*^ p^^ 


0 85 


Y79AA1 000967 


F-Y79AA1 000967 


0 92 


Y79AA1001090 


F-Y79AA1001090 


0 74 


Y79AA1001212 


F-Y79AA1001212 


0 93 


Y79AA1001426 


F-Y79AA1 001426 


0.82 


Y79AA1001523 


F-Y79AA1001523 


0.94 


Y79AA1001727 


F-Y79AA1001727 


0.94 


Y79AA1001787 


F-Y79AA1001787 


0.75 


Y79AA1001799 


F-Y79AA1001799 


0.60 


Y79AA1001803 


F-Y79AA1001803 


0.68 


Y79AA1 002058 


F-Y79AA1 002058 


0.89 


Y79AA1002121 


F-Y79AA1002121 


0.66 


Y79AA1002213 


F-Y79AA1002213 


0.86 
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Table 25 (continued) 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGprl 0.5 or higher. 




name of clone 


name of sequence 


maximal ATGprl score 


Y79AA1 002378 


F-Y79AA1 002378 


0.82 


Y79AA1 002381 


F-Y79AA1 002381 


0.89 


Table 26 


The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 


ATGprl 0.3 or higher and less than 0.5. 




name of clone 


name of sequence 


maximal ATGprl score 


HEMBA1 000732 


F-HEMBA1 000732 


0.32 


HEMBA1001886 


F-HEMBA1001886 


0.46 


HEMBA1002163 


F-HEMBA1002163 


0.44 


HEMBA1002195 


F-HEMBA1002195 


0.41 


HEMBA1003120 


F-HEMBA1 003120 


0.44 


HEMBA1 004007 


F-HEMBA1 004007 


0.31 


HEMBA1 004067 


F-HEMBA1 004067 


0.49 


HEMBA1 005267 


F-HEMBA1 005267 


0.37 


HEMBA1 006770 


F-HEMBA1 006770 


0.40 


HEMBB1 000407 


F-HEMBB1 000407 


0.41 


HEMBB1 000542 


F-HEMBB1 000542 


0.47 


HEMBB1 002120 


F-HEMBB1002120 


0.33 


MAMMA 1000810 


F-MAMMA1 000810 


0.43 


MAMMA1001609 


F-MAMMA1001609 


0.32 


MAMMA1001978 


F-MAMMA1001978 


0.32 


MAMMA1002142 


F-MAMMA1002142 


0.34 


MAMMA1002165 


F-MAMMA1002165 


0.34 


NT2RM2001 792 


F-NT2RM2001792 


0.37 


NT2RM4001843 


F-NT2RM4001843 


0.44 


NT2RP1 000271 


F-NT2RP 1000271 


0.41 


NT2RP2000739 


F-NT2RP2000739 


0.39 


NT2RP2001388 


F-NT2RP2001388 


0.47 


NT2RP2001562 


F-NT2RP2001562 


0.41 


NT2RP2001903 


F-NT2RP2001903 


0.46 


NT2RP2003138 


F-NT2RP2003138 


0.41 


NT2RP2003931 


F-NT2RP2003931 


0.44 


NT2RP2004205 


F-NT2RP2004205 


0.37 


NT2RP2005378 


F-NT2RP2005378 


0.43 


NT2RP2005541 


F-NT2RP2005541 


0.31 


NT2RP2006092 


F-NT2RP2006092 


0.47 


NT2RP2006269 


F-NT2RP2006269 


0.41 


NT2RP3000148 


F-NT2RP3000148 


0.44 


NT2RP3000427 


F-NT2RP3000427 


0.38 


NT2RP3000820 


F-NT2RP3000820 


0.44 


NT2RP3001754 


F-NT2RP3001754 


0.30 


NT2RP3001976 


F-NT2RP3001976 


0.46 


NT2RP3002409 


F-NT2RP3002409 


0.31 


NT2RP3002737 


F-NT2RP3002737 


0.33 


NT2RP3003874 


F-NT2RP3003874 


0.34 
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Table 26 (continued) 



20 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 


ATGpii 0.3 or higher and less than 0.5. 




namp nf f'lonp 


npimp nf ^pnupnce 


maximal ATGprl score 




f-o\/arpi ono?7^ 


0.44 


pa/arpi oooqqr 


F-PA/ARP1 000QQ5 


0.34 


ox/arpi 001 rrq 


F-PA/ARP100156Q 


0.30 


PA/ARP1 nm ^Qfi 


F-PA/ARP1 001 *^QR 


0.36 


pi APFinnnQ07 


F-Pi APFinnnQ07 


0.34 


pi apfioo?qr7 


F-Pl APF1009QR7 


0 38 


pi apfioo^pq 


F-PI APF100^R9Q 


0.31 


PI APF100R071 


F-PI APF100R071 


0.38 


PI APF1 00RR1 


F-PI APFIOORRm 


0.48 


pi apfioo7ari 


F-PI APF1007RR1 


0.43 


PLACE1 008359 


F-PLACE 1008359 


0 47 


PLACE 1008985 


F-PLACE 1008985 


0.48 


PLACE1 009735 


F-PLACE 1009735 


0.37 


PLACE1010784 


F-PLACE1010784 


0.44 


PLACE1011978 


F-PLACE 101 1978 


0.39 


THYR01 000061 


F-THYR0 1000061 


0.38 


THYR01 000580 


F-THYR0 1000580 


0.48 


Y79AA1000127 


F-Y79AA1000127 


0.32 


Y79AA1001272 


F-Y79AA1001272 


0.47 


Y79AA1002129 


F-Y79AA1002129 


0.36 



30 



Table 27 



The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 


ATGprl 0 or higher and less than 0.3. 




name of clone 


name of sequence 


maximal ATGprl score 


HEMBA1 000006 


F-HEMBA1 000006 


0.14 


HEMBA1 000875 


F-HEMBA1 000875 


0.12 


HEMBA1001296 


F-HEMBA1001296 


0.08 


HEMBA1001563 


F-HEMBA1001563 


0.17 


HEMBA1 002227 


F-HEMBA1 002227 


0.05 


H EM BA 1004952 


F-HFMBA1 004952 


0.05 


HEMBA1 004971 


F-HEMBA1 004971 


0.24 


HEMBA1 005230 


F-HEMBA1 005230 


0.14 


HEMBA1 005246 


F-HEMBA1 005246 


0.17 


HEMBA1005913 


F-HEMBA1005913 


0.12 


HEMBA1 006912 


F-HEMBA1006912 


0.11 


H EM BA 1007063 


F-HEMBA1 007063 


0.14 


H EM BA 1007291 


F-HEMBA1 007291 


0.14 


HEMBA1 007332 


F-HEMBA1 007332 


0.23 


HEMBB1 000309 


F-HEMBB1 000309 


0.15 


HEMBB1 000567 


F-HEMBB1 000567 


0.15 


HEMBB1 002039 


F-HEMBB1 002039 


0.15 


MAMMA1 000528 


F-MAMMA1 000528 


0.11 


MAMMA1000614 


F-MAMMA1 00061 4 


0.23 


MAMMA1 000706 


F-MAMMA1 000706 


0.26 


MAMMA1001066 


F-MAMMA1 001066 


0.28 
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Table 27 (continued) 





The clones selected by the keyword(s) of the top hit data in the SwissProt, and having the maximal score in the 




ATGpM 0 or higher and less than 0.3. 




5 


name of clone 


name of sequence 


maximal ATGprl score 




MAMMA1001418 


F-MAMMA1001418 


0.12 




MAMMA1001623 


F-MAMMA1001623 


0.23 




NT2RM2000497 


F-NT2RM2000497 


0.17 


10 


NT2RM4Q00326 


F-NT2RM4000326 


0.23 


NT2RM4000593 


F-NT2RM4000593 


0.16 




NT2RM4000761 


F-NT2RM4000761 


0.13 




NT2RP 1000325 


F-NT2RP1 000325 


0.29 




NT2RP2000178 


F-NT2RP2000178 


0.28 


15 


NT2RP2000240 


F-NT2RP2000240 


0.16 




NT2RP2000712 


F-NT2RP2000712 


0.15 




NT2RP2001469 


F-NT2RP2001469 


0.28 




NT2RP2001817 


F-NT2RP2001817 


0.15 


20 


NT2RP2002510 


F-NT2RP2002510 


0.26 


NT2RP2002824 


F-NT2RP2002824 


0.14 




NT2RP2002974 


F-NT2RP2002974 


0.05 




NT2RP2003940 


F-NT2RP2003940 


0.25 




NT2RP2003950 


F-NT2RP2003950 


0.07 


25 


NT2RP2005391 


F-NT2RP2005391 


0.19 




NT2RP2006134 


F-NT2RP2006134 


0.10 




NT2RP3000125 


F-NT2RP3000125 


0.26 




NT2RP3000481 


F-NT2RP3000481 


0.11 


30 


NT2RP3000652 


F-NT2RP3000652 


0.24 


NT2RP3000677 


F-NT2RP3000677 


0.11 




NT2RP3001012 


F-NT2RP3001012 


0.22 




ft j>k .^k _* ^»k i » j 

NT2RP3001271 


F-NT2RP3001271 


0.29 




NT2RP3001542 


F-NT2RP3001542 


0.28 


35 


OVARC1 000090 


F-OVARC1 000090 


0.21 




OVARC 1000439 


F-OVARC1 000439 


0.21 




OVARC1001260 


F-OVARC1001260 


0.10 




OVARC1002178 


F-OVARC1002178 


0.12 


40 


PLACE1 000033 


F-PLACE 1000033 


0.20 


PLACE1 000258 


F-PLACE1 000258 


0.21 




PLACE 1005539 


F-PLACE1005539 


0.27 




PLACE 1005745 


F-PLACE 1005745 


0.29 




PLACE 1007077 


F-PLACE1 007077 


0.21 


45 


PLACE 1007296 


F-PLACE1007296 


0.27 




PLACE 1007591 


F-PLACE 1007591 


0.20 




PLACE 1007845 


F-PLACE 1007845 


0.21 




PLACE2000118 


F-PLACE2000118 


0.23 


50 


PLACE3000213 


F-PLACE3000213 


0.21 


PLACE4000354 


F-PLACE4000354 


0.20 




Y79AA1 000207 


F-Y79AA1 000207 


0.21 




Y79AA1001062 


F-Y79AA1001062 


0.28 




Y79AA1001863 


F-Y79AA1001863 


0.15 


55 


Y79AA1 002334 


F-Y79AA1 002334 


0.23 




Y79AA1 002376 


F-Y79AA1 002376 


0.29 
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EXAMPLE 13 

Selection of cDNA clone NT2RP2036580 

5 [0184] Clone NT2RP2006580 as well as clone HEMBA1000121 was selected from the representative sequences 
belonging to HRIFA000116a cluster of the most homologous sequence in the SwissProt with the keywoods "trans- 
membrane". Although each of the clones, HEMBA1 000121 and NT2RP2006580, was assembled with other clones for 
5' extension, any other clones did not extend the clones toward the 5' direction. Accordingly, it is possible that both 
clones are full-length cDNA clones. The maximal ATGpM score of F-NT2RP2006580 is 0.37, and therefore, the fullness 

10 ratio is low. However, it is still possible for the sequence to cover the full-length. 

[0185] Thus, the total number of selected clones is 830. Based on the top matching data resulted from Swiss-Prot 
homology search, 659 clones were selected. From them, 447 clones were selected by the keywords of "secretion" and 
"membrane". Among the clones selected based on the top matching data, 60 clones exhibited the maximal ATGprl 
score of 0.3 or higher and less than 0.5. 

15 [0186] The sequences of F-NT2RP2006580 and R-NT2RP2006580 are shown in SEQ ID NO: 2545 and SEQ ID 
NO: 2546, respectively. 

EXAMPLE 14 

20 Full-length sequence analysis and homology search 

[01 87] Full-length sequence was determined for each selected cDNA clones. The nucleotide sequence determination 
was performed mainly by the dye-terminator method using custom synthesized DNA primers according to the primer 
walking procedure (custom synthesized DNA primers were used for sequencing; sequencing reaction was performed 

25 with DNA sequencing reagent supplied by PE Biosystems according to the supplier's manual; and the samples were 
analyzed in an automatic sequencer made by the same supplier). Sequence determination of some clones was carried 
out in the same manner but using a Licor DNA sequencer. Overlapping partial nucleotide sequences, which were 
obtained by the above-described method, were assembled together to determine a full-length nucleotide sequence. 
Amino acid sequences were then deduced from the determined full-length nucleotide sequences. However, amino acid 

30 sequence is not shown for a clone of which coding region was hard to be deduced or of which amino acid sequence 
has less than 1 00 amino acid residues. SEQ ID NOs corresponding to the respective clones are indicated in Table 370. 
[0188] GenBank, Swiss-Prot and UniGene were searched for the determined nucleotide sequences by BLAST anal- 
ysis. Matching data of cDNA clone which exhibits higher homology and of which functions are easily predicted based 
on the nucleotide sequences and the deduced amino acid sequences are selected from the BLAST analysis matching 

35 data with P value of 10 -4 or less. The matching data selected are listed herein. However, there are some clones that 
did not match the criteria for judgment and such matching data of BLAST analysis are not shown herein. The results 
of homology search indicated in the last part of this specification are as follows. 

[0189] Homology search result 1 : data obtained by the homology search of Swiss-Prot database for representative 
sequences of the 5'-end cluster 
40 [0190] Homology search result 2: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
Prot database; the P value is 1 0* 10 or less 

[0191] Homology search result 3: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
Prot database; the P value is higher than 1CT 10 and 10" 4 or less 

[0192] Homology search result 4: homology of representative sequences of the 5'-end cluster to the data in Swiss- 
45 Prot database; the P value is higher than 1(H and 1 or less 

[0193] Homology search result 5: data obtained by the homology search of Swiss-Prot database for 5*-end sequences 
of cDNA clone 

[01 94] Homology search result 6: data obtained by the homology search of GenBank database (http:/lwww.ncbi.nlm. 
nih.gov/web/GenBank/) except for EST and STS sequence data for 5*-end sequences of cDNA clone 
50 [0195] Homology search result 7: data obtained by the homology search of GenBank database (http://www.ncbi.nlm. 
nih.gov/web/GenBank/) except for EST and STS sequence data for 3'-end sequences of cDNA clone 
[0196] Homology search result 8: data obtained by the homology search of Human UniGene database (http://www. 
ncbi.nlm.nih.gov/Unigene) for 5'-end sequences of cDNA clone 

[0197] Homology search result 9: data obtained by the homology search of Human UniGene database (http://www. 
55 ncbi.nlm.nih.gov/Unigene) for 3'-end sequences of cDNA clone 

[0198] Homology search result 1 0: result obtained by the homology search for full-length nucleotide sequences and 
deduced amino acid sequences 

[0199] The P value indicates similarity between two sequences as a score by considering the probability that the two 
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sequences are accidentally similar. In general, as the value is lower the similarity is higher. In general, as the value is 
lower, the homology is higher (Altschul, S.F., Gish, W., Miller, W. : Myers. E.W. & Lipman, D.L (1990) "Basic local 
alignment search tool." J. Mol. Biol. 21 5:403-41 0; Gish, W. & States, D.J. (1 993) "Identification of protein coding regions 
by database similarity search." Nature Genet. 3:266-272). 

Example 15. Gene expression analysis with hybridization using high density DNA filter 

[0200] Nylon membrane for DNA spotting was prepared according to the following procedure. E coli was cultured 
in each well of a 96-well plate (in a LB medium at 37. for 16 hours). A sample of each culture was suspended in 10 
1 of stenle water in a well of a 96-well plate. The plate was heated at 100. for 10 minutes. Then, the boiled samples 
were analyzed by PCR. PCR was performed in a 20 .1 solution by using TaKaRa PCR Amplification Kit (Takara) ac- 
cording to the supplier's protocol. Primers used for the amplification of an insert cDNA in a plasmid were a pair of 
sequencing primers, ME761 FW (5' tacggaagtgttacttctgc 37 SEQ ID NO: 3591) and ME1250RV (5* tgtgggaggttttttctcta 
37 SEQ ID NO: 3592), or a pair of primers, M13M4 (5" gttttcccagtcacgac 37 SEQ ID NO: 3593) and M13RV (5' caq- 
gaaacagctatgac 3' / SEQ ID NO: 3594). PCR was performed using a thermal cycler, GeneAmp System 9600 (PE 
Biosystems) at 95. for 5 minutes; at 95. for 1 0 seconds and at 68. for 1 minute for 1 0 cycles; at 98. for 20 seconds and 
at 60. for 3 minutes for 20 cycles; and at 72. for 1 0 minutes. After the PCR, the 20 . 1 reaction solution was loaded onto 
a 1% agarose gel and fractionated by electrophoresis. DNA on the gel was stained with ethidium bromide to confirm 
the amplification of cDNA. When cDNAs were not amplified by PCR, plasmids containing the corresponding insert 
cDNAs were prepared by the alkali-extraction method (J. Sambrook, E.F., Fritsh, & T. Maniatis, "Molecular Cloning A 
laboratory manual/ 2nd edition, Cold Spring Harbor Laboratory Press, 1989). 

[0201] Preparation of DNA array was carried out by the following procedure. A sample of a DNA solution was added 
in each well of a 384-well plate. DNA was spotted onto a nylon membrane (Boehringer) by using a 384-pin tool of 
Biomek 2000 Laboratory Automation System (Beckman-Coulter). Specifically, the 384-well plate containing the DNA 
was placed under the 384-pin tool. The independent 384 needles were simultaneously dipped into the DNA solution 
for DNA deposition. The needles were gently pressed onto a nylon membrane and the DNA deposited at the tips of 
needles was spotted onto the membrane. Denaturation of the spotted DNA and immobilization of the DNA on the nylon 
membrane were carried out according to standard methods (J. Sambrook, E.F.. Fritsh, & T. Maniatis, "Molecular Clon- 
ing, A laboratory manual/ 2nd edition, Cold Spring Harbor Laboratory Press. 1989). 

[0202] A probe for hybridization was radioisotope- labeled first strand cDNA. Synthesis of the first strand cDNA was 
performed by using Thermoscript*™) RT-PCR System (GIBCO). Specifically, the first strand cDNA was synthesized 
by using 1.5 .g of mRNAs from various human tissues (Clontech), 1 .1 of 50.M Oligo(dT)20 and 50 Ci [^P]dATP 
according to an attached protocol. Purification of a probe was carried out by using ProbeQuant(™) G-50 micro column 
(Amersham- Pharmacia Biotech) according to an attached protocol. In the next step. 2 units of E. coli RNase H were 
added to the reaction mixture. The mixture was incubated at room temperature for 10 minutes, and then 100 g of 
human COT-1 DNA (GIBCO) was added thereto. The mixture was incubated at 97. for 1 0 minutes and then was allowed 
to stand on ice to give hybridization probe. 

[0203] Hybridization of the radioisotope-labeled probe to the DNA array was performed according to standard meth- 
ods (J. Sambrook, E.F., Fritsh, & T. Maniatis, Molecular Cloning, A laboratory manual/ 2nd edition, Cold Spring Harbor 
Laboratory Press, 1989). The membrane was washed as follows: the nylon membrane was washed 3 times by incu- 
bating ,t in Washing solution 1 (2xSSC, 1 % SDS) at room temperature (about 26.) for 20 minutes; then the membrane 
was washed 3 times by incubating it in Washing solution 2 (O.lxSSC, 1% SDS) at 65. for 20 minutes. 
[0204] Autoradiography was performed by using an image plate for BAS2000 (Fuji Photo Film Co., Ltd.) Specifically 
the nylon membrane with probe hybridized thereon was wrapped with a piece of Saran Wrap and brought into tight 
contact with the image plate on the light-sensitive surface. The membrane with the image plate was placed in an 
imaging cassette for radioisotope and allowed to stand in dark place for 4 hours. The radioactivity recorded on the 
image plate was analyzed by using BAS2000 (Fuji Photo Film Co., Ltd.). The activity was subjected to electronic 
conversion and recorded as an image file of autoradiogram. The signal intensity of each DNA spot was analyzed by 
using Visage High Density Grid Analysis Systems (Genomic Solutions Inc.). The signal intensity was converted into 
numerical data. The data were taken in duplicate. The reproducibility was assessed by comparing the signal intensities 
of the corresponding spots on the duplicated DNA filters that were hybridized to a single DNA probe (Figure 2) In 95% 
of entire spots, the ratio between the corresponding spots falls within a range of 2 or less, and the correlation coefficient 
is r=1 .97. Thus, the reproducibility is satisfactory. 

[0205] The detection sensitivity in gene expression analysis was estimated by examining increases in the signal 
intensity of probe concentration-dependent spot in hybridization using a probe complementary to the DNA spotted on 
the nylon membrane. DNA used was PLACE 1008092 (the same as DNA deposited in GenBank under an Accession 
No. AF1 07253). The DNA array with DNA of PLACE 1 008092 was prepared according to the above-mentioned method 
The probe used was prepared as follows: mRNA was synthesized in vitro from the clone, PLACE1 008092 By using 
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this mRNA as a template, radioisotope-labeled first strand cDNA was synthesized in the same manner as described 
above, and the cDNA was used as the probe. In order to synthesize mRNA from PLACE1 008092 in vitro, a plasmid 
in which the 5' end of the cDNA PLACE1 008092 was ligated to the T7 promoter of pBluescript SK(-) was constructed. 
Specifically, the PLACE1 008092 insert was cut out from pM E1 8SFL3 carrying the cDNA at a Dralll site thereof by Xhol 
digestion. The resulting PLACE1 008092 fragment was ligated to Xhol-predigested pBluescript SK(-) by using DNA 
ligation kit ver.2 (Takara). The in vitro mRNA synthesis from PLACE1 008092 inserted into pBluescript SK(-) was carried 
out by using Ampliscribef™) T7 high yield transcription kit (Epicentre technologies). Hybridization and the analysis of 
signal intensity of each DNA spot were performed by the same methods as described above. When the probe con- 
centration is 1x10 7 .g/ml or less, there was no increase of signal intensity proportional to the probe concentration. 
Therefore, it was assumed to be difficult to compare the signals with one another in this concentration range. Thus, 
the spots with the intensity of 40 or less were uniformly taken as low level signals (Figure 3). Within a concentration 
of the probe ranging from 1 x1 0 7 .g/ml to 0.1 .g/ml, the signal was found to increase in a probe concentration-dependent 
manner. The detection limit represented as the ratio of the expression level of test mRNA to that of total mRNA in a 
sample was 1:100,000. 

[0206] Tables 28-184 (also containing clones without description in Examples) show the expression of each cDNA 
in human normal tissues (heart, lung, pituitary gland, thymus, brain, kidney, liver and spleen). The expression levels 
are indicated with numerical values of 0-10,000. Genes that were expressed in at least a single tissue are indicated 
below by the corresponding clone names: 

clone: BNGH41 000020, BNGH41 000087, BNGH41 000091, 
HEMBA1 000300, HEMBA1 000443, HEMBA1 000462, HEMBA1 000477, 



HEMBA1 000835, 
HEMBA1001390, 
HEMBA1 002164, 
HEMBA1002316, 
HEMBA1 002992, 
HEMBA 1003294, 
HEMBA1 003945, 
HEMBA1 004444, 
HEMBA1 005084, 
HEMBA1 005449, 
HEMBA1005016, 
HEMBA1 006430, 
HEMBA1 006749, 
HEMBA1007013, 



HEMBA1 000875, 
HEMBA1001563, 
HEMBA1002167, 
HEMBA1 002421, 
HEMBA1 003047, 
HEMBA1003315, 
HEMBA1 004007, 
HEMBA1 004454, 
HEMBA1 005145, 
HEMBA1 005489, 
HEMBA1006171, 
HEMBA1 006482, 
HEMBA1 006770, 



HEMBA1 000940 
HEMBA1 001621 
HEMBA1 002178 
HEMBA1 002524 
H EM BA1 003072 
H EM BA1 003392 
HEMBA1 004067 
HEMBA1 004505 
HEMBA1 005230 
HEMBA1 005545 
HEMBA1 006276 
HEMBA1006517 
HEMBA1 006902 
HEMBA1 007063 



HEMBA1 007057 
HEMBB1000309, HEMBB1000447, HEMBB1000542, 

HEMBB1 000567, HEMBB1 000642, HEMBB1 000905 

HEMBB1001530, HEMBB1001573, HEMBB1001847 

HEMBB1002041, HEMBB1002051 , HEMBB1002162 

HEMBB1 002465, HEMBB1 002661 , HEMBB1 002663, 

MAMMA1000106, MAMMA1000118, MAMMA1 000204 

MAMMA1 000457, MAMMA1 000473, MAMMA1 000528 

MAMMA1 000681, MAM MA1 000706, MAMMA1 000788 

MAMMA1 000986, MAMMA1 000994, MAMMA1001043 

MAMMA1001150, MAMMA1001284, MAMMA1001310 

MAMMA1001609, MAMMA1001 615, MAMMA1001634 

MAMMA1 002070, MAMMA1 002091 , MAMMA1 002095 

MAMMA1 002205, MAMMA1 002224, MAMMA1 002586, 

NT2RM1 000542, NT2RM1 000789, NT2RM1 000855, 

NT2RM200041 0, NT2RM2000423, NT2RM2000497, 

NT2RM2000589, NT2RM2000622, NT2RM2000773, 

NT2RM2001941 , NT2RM4000198, NT2RM4000295, 



HEMBA 1000962, 
HEMBA1 002048, 
HEMBA1002195, 
HEMBA1 002551, 
HEMBA1003101, 
HEMBA1 003399, 
HEMBA1001085, 
HEMBA1004797, 
HEMBA1005246, 
HEMBA1005698, 
HEMBA1 006311, 
HEMBA1006544, 
HEMBA1006912, 
HEMBA1 007291, 

HEMBB1001026, 
HEMBB1001959, 
HEMBB1 002228, 
HEMBB1 002693. 



HEMBA1000121, 
HEMBA1 000634 
HEMBA1001228 
HEMBA1002131 
H EM BA1 002227 
HEMBA1002767 
HEMBA1003120 
HEMBA1 003487 
HEMBA1004110 
HEM BA1 004952 
HEMBA1 005337 
HEMBA1 005929 
HEMBA1 006335 
HEMBA1 006658 
HEMBA1006916 
HEMBA1 007332 

HEMBB1001048 
HEMBB1001978 
HEMBB1 002302 
MAMMA1 000046, 



MAMMA1 000226, MAMMA1 000403 

MAMMA1000591, MAMMA1000614 

MAMMA1000810, MAMMA1000814 

MAMMA1001066, MAMMA1001094 

MAMMA1001344, MAMMA1001418 

MAMMA1001893, MAMMA1001901 



MAMMA1002128, 
MAMMA1003126 : 
NT2RM1 000858, 
NT2RM2000514, 
NT2RM2001126, 
NT2RM4000444, 



NT2RM4000965, NT2RM4000997, NT2RM4001321 , NT2RM4001325, 



NT2RM4001377, 
NT2RP1 000271, 
NT2RP1 000903, 
NT2RP2Q00479, 
NT2RP2001200, 
NT2RP2001529, 



NT2RM4001735 : 
NT2RP1 000300, 
NT2RF1 000981, 
NT2RP2000533, 
NT2RP2001223, 
NT2RP2001538, 



NT2RM4001768, 

NT2RP1 000325, 

NT2RP2000092, 

NT2RP2000610, 

NT2RP2001388, 

NT2RP2001562, 



NT2RM4001843, 

NT2RP1 000465, 

NT2RP2000178, 

NT2RP2000616, 

NT2RP2001469, 

NT2RP2001662, 



MAMMA1002142 
NT2RM 1000407 
NT2RM2000241 
NT2RM2000565 
NT2RM2001626 
NT2RM4000593 

NT2RP1 000002 
NT2RP1 000468 
NT2RP2000240 
NT2RP2000694 
NT2RP2001480 
NT2RP2001878 



HEMBA1 000275 
HEMBA1 000713 
HEMBA1001296 
HEMBA1002163 
HEMBA1 002239 
HEMBA1 002985 
HEMBA1 003230 
HEMBA1 003530 
HEMBA1 004391 
HEMBA1 005070 
HEMBA1 005430 
HEMBA1 005945 
HEMBA1 006357 
HEMBA1 006707 
HEMBA1 006960 
HEMBB1000106 

HEMBB1001407 
HEMBB1 002039 
HEMBB1 002427 
MAMMA1000102 
MAMMA1 000449 
MAMMA1 000652 
MAMMA1 000881 
MAMMA1001141 
MAMMA1001532 
MAMMA1001957 
MAMMA1002165 
NT2RM1 000462 
NT2RM2000306 
NT2RM2000582 
NT2RM2001792 
NT2RM4000761 

NT2RP1000181 
NT2RP1 000740 
NT2RP2000447 
NT2RP2000739 
NT2RP2001514 
NT2RP2001903 
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NT2RP2001921, 
NT2RP2002409, 
NT2RP2002824, 
NT2RP20Q3302, 
NT2RP2003599, 
NT2RP2004141, 
NT2RP2004606, 
NT2RP2005069, 
NT2RP2005425, 
NT2RP2005666, 
NT2RP2006042, 
NT2RP3000059, 
NT2RP3000201, 
NT2RP3000645, 
NT2RP3000818, 
NT2RP3001159, 
NT2RP3001592, 
NT2RP3002015, 
NT2RP3002409, 
NT2RP3002900, 
NT2RP3003469, 
NT2RP3003874, 
NT2RP3004090, 
NT2RP3004309, 
NT2RP3004552, 
NT2RP4000634, 
OVARC1 000090, 
OVARC1 000331, 
OVARC1 000775, 
OVARC1001030, 
OVARC1001569, 
PLACE1 000231, 
PLACE1001016, 
PLACE1001407, 
PLACE1 002080, 
PLACE1 002547, 
PLACE1 003407, 
PLACE1 003644, 
PLACE1004166, 
PLACE1 004441, 
PLACE1 004648, 
PLACE1 005410, 
PLACE1 005736, 
PLACE1 006073, 
PLACE1 007077, 
PLACE1 009067, 
PLACE1 011045, 
PLACE1011708, 
PLACE4000455, 
THYRO1000866 : 
THYR01 001237, 
THYRO1001523 : 
THYRO1001725 ; 
Y79AA1 000270, 
Y79AA1 000967, 
Y79AA1001427, 
Y79AA1001787, 
Y79AA1002213, 



NT2RP2001956, 

NT2RP2002510, 

NT2RP2002942, 

NT2RP2003369, 

NT2RP2003655, 

NT2RP2004179, 

NT2RP2004648, 

NT2RP2005163, 

NT2RP2005463, 

NT2RP2005774, 

NT2RP2006092, 

NT2RP3000063, 

NT2RP3Q00232, 

NT2RP3000652, 

NT2RP3000820, 

NT2RP3001170, 

NT2RP3001685, 

NT2RP3002160, 

NT2RP3002411, 

NT2RP3002958, 

NT2RP3003473, 

NT2RP3003939, 

NT2RP3004119, 

NT2RP3004321, 

NT2RP3004557, 

NT2RP4001877, 

OVARC1000105, 

OVARC 1000410, 



NTT2RP2002015, 
NT2RP2002527, 
NT2RP2002974, 
NT2RP2003383, 
NT2RP2003940, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005181, 
NT2RP2005535, 
NT2RP2005878, 
NT2RP2006099, 
NT2RP3000125, 
NT2RP3000304, 
NT2RP3000676, 
NT2RP3000838, 
NT2RP3001195, 
NT2RP3001738, 
NT2RP3002281, 
NT2RP3002721, 
NT2RP3003000, 
NT2RP3003532, 
NT2RP3003963, 
NT2RP3004130, 
NT2RP3004355, 
NT2RP3004625, 
NT2RP4001879, 
OVARC1000137, 
OVARC 1000439, 



OVARC 1 000853, OVARC 1 000873, 

OVARC1 001 049, OVARC 1 001 086^ 

OVARC 1 001 570, OVARC 1 001 596, 

PLACE1 000258, PLACE1 000442, 



PLACE1001100, 
PLACE1001500, 
PLACE 1002095, 
PLACE1 002726, 
PLACE1 003428, 
PLACE 1003772, 
PLACE1004168, 
PLACE 1004482, 
PLACE1004816, 
PLACE 1005426, 
PLACE 1005768, 
PLACE 1006079, 
PLACE 1007096, 
PLACE 1009527, 
PLACE1011116, 
PLACE1011978, 
SKNMC1000014, 
THYROI 000999, 



PLACE1001114, 
PLACE1001516, 
PLACE1002153, 
PLACE1 002905, 
PLACE1 003438, 
PLACE1 003839, 
PLACE1004199, 
PLACE1 004492, 
PLACE1 004887, 
PLACE1 005539, 
PLACE1 005815, 
PLACE 1006277, 
PLACE1 007626, 
PLACE1 009982, 
PLACE1011181, 



NT2RP2002063, 
NT2RP2002533, 
NT2RP2003138, 
NT2RP2003390, 
NT2RP2003950, 
NT2RP2004447, 
NT2RP2004837, 
NT2RP2005247, 
NT2RP2005541, 
NT2RP2005887, 
NT2RP2006269, 
NT2RP3000148, 
NT2RP3000378, 
NT2RP3000677, 
NT2RP3000907, 
NT2RP3001271, 
NT2RP3001754, 
NT2RP3002311, 
NT2RP3002737, 
NT2RP3003076, 
NT2RP3003614, 
NT2RP3004025, 
NT2RP3004133, 
NT2RP3004374, 
NT2RP3004640, 
NT2RP4002187, 
OVARC1 000208, 
OVARC1 000467, 
OVARC1000916, 
OVARC1001163, 
OVARC1001807, 
PLACE1 000560, 
PLACE1001183, 
PLACE1001655, 
PLACE1 002329, 
PLACE1 002911, 
PLACE1 003460, 
PLACE 1003845, 
PLACE1 004279, 
PLACE1 004520, 
PLACE 1005005, 
PLACE 1005544, 
PLACE 1005878, 
PLACE1 006443, 
PLACE1 007702, 
PLACE1010078, 
PLACE1011236, 
PLACE2000219 



PLACE2000118 

THYRO1000061, THYRO1000099, 
THYR01 001 063, THYROI 001 1 1 3^ 
THYRO1001242, THYRO1001456, THYRO1001457 
THYRO1001529, THYRO1001593, THYRO1001608! 
THYROI 001 770, THYROI 001803, Y79AA1 000127. 

Y79AA1 000521, 
Y79AA1001090, 
Y79AA1 001523, 
Y79AA1001799, 



Y79AA1 000426, 
Y79AA1001013, 
Y79AA1001430, 
Y79AA1001793, 



Y79AA1 002373, Y79AA1 002376, Y79AA1 002381 



Y79AA1 000776, 
Y79AA1001272, 
Y79AA1001530, 
Y79AA1001803, 



NT2RP2002188, 
NT2RP2002564, 
NT2RP2003179, 
NT2RP2003469, 
NT2RP2004069, 
NT2RP2004524, 
NT2RP2004847, 
NT2RP2005378, 
NT2RP2005597, 
NT2RP2005941, 
NT2RP3000011, 
NT2RP3000171, 
NT2RP3000436, 
NT2RP3000721, 
NT2RP3000921, 
NT2RP3001388, 
NT2RP3001858, 
NT2RP3002324, 
NT2RP3002738, 
NT2RP3003354, 
NT2RP3003729, 
NT2RP3004067, 
NT2RP3004202, 
NT2RP3004406, 
NT2RP3004647, 
NT2RP4002715, 
OVARC1 000255, 
OVARC1 000529, 
OVARC1 000956, 
OVARC1 001260, 
OVARC1 001833, 
PLACE1 000912, 
PLACE1 001229, 
PLACE1001836, 
PLACE1 002374, 
PLACE1 002967, 
PLACE 1003529, 
PLACE1 003852, 
PLACE1 004282, 
PLACE1 004630, 
PLACE1 005031, 
PLACE1 005569, 
PLACE 1005927, 
PLACE1006716, 
PLACE1 008469, 
PLACE1010251, 
PLACE1011364, 
PLACE3000181, 
THYROI 000584, 
THYR0 1001 128, 
THYRO1001478, 
THYRO1001700, 
Y79AA1 000207, 
Y79AA 1000777, 
Y79AA1001328, 
Y79AA1001592, 
Y79AA1001863, 



NT2RP2002232, 
NT2RP2002721, 
NT2RP2003210, 
NT2RP2003593, 
NT2RP2004108, 
NT2RP2004556, 
NT2RP2005027, 
NT2RP2005391, 
NT2RP2005632, 
NT2RP2006004, 
NT2RP3000022, 
NT2RP3000172, 
NT2RP3000460, 
NT2RP3000789, 
NT2RP3001044, 
NT2RP3001560, 
NT2RP3001976, 
NT2RP3002353, 
NT2RP3002836, 
NT2RP3003448, 
NT2RP3003849, 
NT2RP3004083, 
NT2RP3004294, 
NT2RP3004481, 
NT2RP4000108, 
NT2RP4002750, 
OVARC1 000313, 
OVARC1 000553, 
OVARC1 000995, 
OVARC1001336, 
OVARC1001991, 
PLACE1 000927, 
PLACE1001340, 
PLACE1001918, 
PLACE1002518, 
PLACE1003163, 
FLACE1 003598, 
PLACE1 004078, 
PLACE1 004305, 
PLACE 1004637, 
PLACE1 005383, 
PLACE1 005725, 
PLACE1 006071, 
PLACE1 006809, 
PLACE1 008985, 
PLACE1010445, 
PLACE1011516, 
PLACE4000354, 
THYROI 000795, 
THYRO1001205, 
THYRO1001495, 
THYRO1001702, 
Y79AA1 000226, 
Y79AA1 000888, 
Y79AA1001426, 
Y79AA1001727, 
Y79AA1 002022, 
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[0207] Genes that were expressed in all the tissues tested are indicated below by the corresponding clone names: 
clone: BNGH41000020, HEMBA1000300, HEMBA1001390, HEMBA1002239, HEMBA1002316, HEMBA1004007, 



HEMBA 1004067, 

HEMBB1 000567, 

MAMMA1000814, 

MAMMA1002165, 

NT2RM2000423, 

NT2RP2001538, 

NT2RP2002510, 

NT2RP2005163, 

NT2RP3001271, 

NT2RP3004647, 

OVARC1001049, 

PLACE 1003598, 

THYRO1001128, 



HEMBA1005145, 
HEMBB1001847, 
MAMMA1001066, 
MAMMA1 002205, 
NT2RM4000761, 
NT2RP2001662, 
NT2RP2002533, 
NT2RP2005535, 
NT2RP3001754, 
OVARC 1000090, 
PLACE1 000231, 
PLACE1 004648, 
THYRO1001205, 



HEMBA1 005230, HEMBA1 005929, 
NEMBB1001978, MAMMA1000614, 
MAMMA1001094, MAMMA1001284, 
MAMMA1 002224, 
NT2RM4000997, 



NT2RP2001903, 
NT2RP2004556, 
NT2RP2006269, 
NT2RP3003076, 
OVARC1 000208, 
PLACE1 000258, 
PLACE1 006443, 
THYRO1001242. 



NT2RM 1000462, 
NT2RP1 000271, 
NT2RP2002015, 
NT2RP2004794, 
NT2RP3000171, 
NT2RP3003354, 
OVARC1 000553, 
PLACE1001516, 
PLACE1 008469, 
THYRO1001803. 



HEMBA1 006357, 
MAMMA1 000652, 
MAMMA1001310, 
NT2RM 1000855, 
NT2RP1 000325, 
NT2RP2002188, 
NT2RP2004847, 
NT2RP3000645, 
NT2RP3003614, 
OVARC1 000995, 
PLACE 1002080, 
PLACE1011708, 
Y79AA1 000207. 



HEMBA1 006482, 
MAMMA1 000810, 
MAMMA1001634, 
NT2RM1 000858, 
NT2RP1 000465, 
NT2RP2002409, 
NT2RP2005069, 
NT2RP3000838, 
NT2RP3004640, 
OVARC1001030, 
PLACE1 00291 1 , 
PLACE2000118, 
Y79AA1001013, 



Y79AA1 001272, Y79AA1001328, Y79AA1001793, Y79AA1001863, Y79AA1 002022, Y79AA1 002376. 

[0208] Genes that were expressed at low levels in any of the tissues tested are indicated below by the corresponding 

clone names: clone: HEMBA1 000006, HEMBA1000128, HEMBA1000349, HEMBA1000590, HEMBA1000671 



HEMBA1 000732, 

HEMBA1001297, 

HEMBA1 003732, 

HEMBA1 005522, 

HEMBB1 000276, 

HEMBB1 001547, 

MAMMA1001623, 

NT2RM1 000580, 

NT2RM2001939, 

NT2RM4000587, 

NT2RP1 000448, 

NT2RP1001020, 

NT2RP2000663, 

NT2RP2001755, 

NT2RP2002674, 



HEMBA1 000745, 

HEMBA1001878, 

HEMBA1 004250, 

HEMBA1 005913, 

HEMBB1 000407, 

HEMBB1 002120, 

MAMMA1001978, 

NT2RM1 000899, 

NT2RM4000100, 

NT2RM4000648, 

NT2RP1 000551, 

NT2RP1001031, 

NT2RP2000712, 

NT2RP2001769, 

NT2RP2002976, 



HEMBA1 000907, 
HEMBA1001886, 
HEMBA1 004785, 
HEMBA1 006299, 
HEMBB1 000668, 
HEMBB1 002245, 
MAM MA 1002080, 
NT2RM2000632, 
NT2RM4000115, 
NT2RM4002352, 
NT2RP1 000579, 
NT2RP1001563, 
NT2RP2000818, 
NT2RP2001817, 
NT2RP2003042, 



HEMBA1001184, 
HEMBA1002420 
HEMBA1004971 
HEMBA1 006572 
HEMBB1 000679 
MAMMA1 000141, 
MAMMA1 002087 
NT2RM2001643 
NT2RM4000284 
NT2RP1 000050 
NT2RP1000613 
NT2RP2000394 
NT2RP2000903 
NT2RP2001915 
NT2RP2003545 



NT2RP2004495, NT2RP2004670, NT2RP2005514, NT2RP2005883, NT2RP2005994, 



NT2RP2006134, 

NT2RP3000871, 

NT2RP3002286, 

NT2RP3002887, 

NT2RP3004075, 

NT2RP4002451, 

OVARC1001132, 

OVARC1 002058, 

PLACE1001123, 

PLACE1001788, 

PLACE1 003737, 

PLACE1 005250, 

PLACE1 005745, 

PLACE1 006786, 

PLACE1 007591, 

PLACE1 008359, 

PLACE1009196, 

PLACE1 009279, 

PLACE1010713, 

PLACE3000213, 

THYR01 000580, 

THYRO1001102, 



NT2RP2006512, 
NT2RP3001012, 
NT2RP3002342, 
NT2RP3002983, 
NT2RP3004345, 
OVARC1 000003, 
OVARC1001222, 
OVARC1002178, 
PLACE1 001231, 
PLACE1001795, 
PLACE 1004028, 
PLACE1005519, 
PLACE1 006093, 
PLACE 1006959, 
PLACE 1007845, 
PLACE 1008549, 

PLACE1 009546, 
PLACE1010784, 
SKNMC1 000004. 



NT2RP3000169, 
NT2RP3001061, 
NT2RP3002448, 
NT2RP3003527, 
NT2RP4000962, 
OVA RC1 000275, 
OVARC1001338, 
PLACE1 000033, 
PLACE1001401, 
PLACE1001949, 
PLACE1 004450, 
PLACE1 005601, 
PLACE1 006208, 
PLACE1 007028, 
PLACE1 007881, 
PLACE1 008657, 

PLACE1 009600, 
PLACE1010827, 
SKNMC1 000082, 



THYR01 000678, THYR01 000776, 
THYR01 001 266, THYR01 001 327. 



NT2RP3000444 
NT2RP3001240 
NT2RP3002571 
NT2RP3003535 
NT2RP4001001 
OVARC1 000298 
OVARC1001607 
PLACE 1000740 
PLACE1001464 
PLACE 1002355 
PLACE1004519 
PLACE 1005660 
PLACE1006219 
PLACE1 007040 
PLACE 1007971 
PLACE1008716 

PLACE1 009735 
PLACE1010968 
THYR01 000036 
THYR01 000846, 
THYRO1001471 



HEMBA1001221 
H EM BA1 003497 
H EM BA1 004982 
HEMBA1006724 
HEMBB1000881 
MAMMA1 000496 
MAMMA1 002234 
NT2PM2001818 
NT2RM4000326 
NT2RP1 000239 
NT2RP 1000679 
NT2RP2000514 
NT2RP2001276 
NT2RP2001948 
NT2RP2003664 



NT2RP3000481 
NT2RP3001322 
NT2RP3002664 
NT2RP3003559 
NT2RP4001009 
OVARC1 000307 
OVARC1001725 
PLACE1000914 
PLACE1001536 
PLACE1003135 
PLACE 1005003 
PLACE1 005669 
PLACE1 006290 
PLACE1 007081 
PLACE1 008282 
PLACE1 008744 

PLACE1010011 
PLACE1011407 
THYRO1000196 
THYR0 1000956 
Y79AA1 000876 



HEMBA1001272 
HEMBA1 003602 
HEMBA1 005267 
HEMBA1 007241 
HEMBB1001200 
MAMMA1001237 
MAMMA1 002633 
NT2RM2001902 
NT2RM4000417 
NT2RP1 000261 
NT2RP1001004 
NT2RP2000649 
NT2RP2001495 
NT2RP2002304 
NT2RP2003931 

NT2RP3000616 
NT2RP3001542 
NT2RP3002790 
NT2RP3004000 
NT2RP4001467 
OVARC1 000811 
OVARC1001727 
PLACE1 000986 
PLACE1001564 
PLACE1 003573 
PLACE1 005239 
PLACE1 005682 
PLACE1006515 
PLACE1 007296 
PLACE1 008297 
PLACE1 008984 

PLACE1010081 
PLACE1011824 
THYR01 000400 
THYRO1001071 
Y79AA1 000959 



Y79AA1001056, Y79AA1001062, Y79AA1 001264, Y79AA1001 795. 

[0209] Genes exhibiting characteristic features in the expression thereof were selected by statistical analysis of these 
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HI ?h W " ,0 d6SCribe ,he SeleCti ° n °' 9enes of which expression is varied greatly among 
ZZa^LlT 'T 6 ' S ^ ,reqUemy 35 3 COn,r0 ' in 9ene eX P ression ana ^ is - Sen- of which expression is 
™ JT ^ I 9 U6S 35 C ° mpared ,hat °' ,hS - actin 9ene were determined as fo.lows. Specific^, sum of 
Zees 22? W , aS H C f ,CU,a,ed in ,he signal in,ensit y * acm observed in each ttesue, which was divSed by 7 
degrees of freedom to determme a variance S,*. Next, sum of squared deviation was calculated in the signal intensity 

e* ract^re SSfS" ° k 7k 3 SI9nlf,Cance level of 5% °' ™« were extracted in the F distribution. Genes 
SKEoS* corresponding clone names: clone: BNGH41000020, NT2RM4000761, 

Genes of which express.cn ,s varied greatly among tissues as compared that of the OVARC1000037 gene were de 
termmed as foHows. Specifically, sum of squared deviation was calculated in the signal intensity of actin observed in 
each tissue wh C h was divided by 7 degrees of freedom to determine a variance S a * Next, sum of sqTred deSon 

of freedom to determ.ne a variance Sf. By taking variance ratio F as F=S b 2/S a 2. genes with a significance Wof 5% 

m ™!f r3Cted iP ,hG F diStribution ' Genes extrac,ed ™ indicated below by the correal ctone na^es 
MH41000020, HEMBA1000300, OVARC1001030, NT2RM4000761, PLACE1000 2 Th1K^^^^^^ 
NT2RP1000325, NT2RP1 000271 , PLACE1004648, HEMBA1005145. HEM^1005929 ^IT2RP200251o' 
NT2RP2001538, NT2RP2002409, NT2RP20021 88. NT2RP2001903, r^2RP2oS 5 33 nSrpSSo 5 
NT2RP2006269, NT2RP2004837, NT2RP2004205, NT2RP2005378. HEMBA1006357 ' HE M S ' 
NT2RP2003940, NT2RP2004794, HEMBA1006912, NT2RP2004556, NT2RP3000838 

=: =: JS~ ES-st = =„' ™€ 
==%=^=° 0,e34 ' — MAM ~' ™ = £ 

Analysis of disease-associated genes 

[0212] Non-enzymic protein glycation reaction is believed to be a cause of a variety of chronic diabetic complications 
ceTa^^ 

2, JT f T complications caused by glycated proteins. Vascular endothelial cells are affected 

ZSorSZTZT I" b, t °°H d - Re3Cti ° n Pr ° dUC,S ° f n0n " enZymiC P r ° tein ^ Cali ™ includ * amadoricompou □ 
(gtycated prote.n) as a m,ldly grated protein and advanced glycation endproduct as a heavily gVcated protein Hence 

a survey was earned out for genes of which expression levels are varied depending on the presence oHhesTa^cated 
proteins ,n endothelial cells. The mRNAs were extracted from endothelial cells that we* ^cuTuredtnlhe £££2 
absence of glycated protein. The m RIM As were converted into radiolabeled first strand cDNAsTor prepaEproTes 
The probes were hybndized to the above-mentioned DMA array. Signal of each DNA spot was deS ^TbaS.00 
and analyzed by ArrayGauge (Fuji Photo Film Co., Ltd ) °e ie «ea D y ba^uoo 

SbSA 31 'sii^f™ ^ 9 ^ i t ° n and P ro u duct of bovine ^rum albumin was prepared as fo.lows: bovine serum albumin 
(BSA S.gma) was mcubated ,n a phosphate buffer solution containing 50 mM glucose at 37 for 8 weeks andThe 
resultmg browmsh BSA was dialed against a phosphate buffer solution 

K2L uT^T? P ulmo "^ arterial endothelial cells (Cell Applications) were cultured in an Endothelial Cell 

Growth Mrtum (CeN Apphcations). The culture dish (Farcon) with the ce.ls were incubated in a CQ 2 

5 A CO , n a humid atmosphere). When the cells were grown to be confluent in the dish, 250.g/ml of bovine serum 

T^XZleT^T 0V 7 Semm ^r'" (Si9m3) " 3dV3nCed atyCati ° n end P rodu « ° f seZalbuZ 
r (T ^ 6 Ce " S W6re lncubated for 33 hours. The mRNA was extracted from the cells bv usino a 

[0215] Table 185 shows the expression level of each cDNA in human pulmonary arterial endothelial cells cultured 

iZST C T mmg b ° Vine 3,bUmin (Si9ma) ' 9,yca,ed bovine serum (Sigma) or advanced gSS 

endproduct of bovme serum albumin. Genes of which expression was detected in the endothelial cell are as 

BNGH41000020, BNGH41000087, HEMBA1000275. HEMBA1000300 HEMBa"o00477 HEMBAIOOOR^ 

HEMBA1000671, HEMBA1000713, HEMBA1 000745. HEMBA1000835 HEMBA1 000875 HEMB^ So940 

HEMBA1001390, HEMBA1002131, HEMBA1002163. HEMBA1002164 HEMBA ' HEMbJ 0^7 

SpSUSS" HEMBA1002420 - HEMBA1002767. HEMBA1002992,' HEM BA1 003047' HEMb£ S ' 

HEMBA1003294, HEMBA1003315, HEMBA1003602, HEMBA1003945, HEMBA1004007,' HEMb£ SSoS' 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



HEMBA1 004971, 
HEMBA1 005945, 
HEMBA1 006482, 
HEMBA1 007063, 
HEMBB1001573, 
HEMBB1 002693, 
MAMMA1000614 
MAMMA1001066 
MAMMA1001893 
MAMMA1 002205 
NT2RM1 000789 
NT2RM2000241 
NT2RM2000622 
NT2RM4000587 
NT2RM4001377 
NT2RP1 000271 
NT2RP2000092 
NT2RP2000649 
NT2RP2001480 
NT2RP2001903 
NT2RP2002409 
NT2RP2002824 
NT2RP2003593 
NT2RP2004108 
NT2RP2004606 
NT2RP2005069 
NT2RP2005541 
NT2RP2005878 
NT2RP3000171 
NT2RP3000652 
NT2RP3000907 
NT2RP3001592 
NT2RP3002353 
NT2RP3002836 
NT2RP3003532 
NT2RP3004075 
NT2RP3004294 
NT2RP3004640 
NT2RP4001467 
OVARC1 000003 
OVARC1000313 
OVARC1 000853 



HEMBA1005145, 
HEMBA1006171, 
HEMBA1 006658, 
HEMBB1 000447, 
HEMBB1001847, 
MAMMA1000102, 
MAMMA1 000652, 
MAMMA1001237, 
MAMMA1001901, 



HEMBA1 005267, 
HEMBA1 006299, 
HEMBA1 006724, 
H EM BB1 000642, 
HEMBB1001978, 
MAMMA1000106, 
MAM MA1 000810, 
MAMMA1001284, 
MAMMA1001957, 
MAMMA1 002633, 



, MAMMA1 002224, 
NT2RM1 000855, NT2RM1 000858, 
NT2RM2000306, NT2RM200041 0. 



NT2RM2000773 
NT2RM4000593 
NT2RM4001735 
NT2RP 1000300 
NT2RP2000240 
NT2RP2000663 
NT2RP2001495 
NT2RP2001948 
NT2RP2002510 
NT2RP2002942 
NT2RP2003599 
IMT2RP2004179 
NT2RP2004648 
NT2RP2005163 



NT2RP2005632, NT2RP2005774, 



NT2RP2006099 

NT2RP3000201 

NT2RP3000676 

NT2RP3001159 

NT2RP3001738 

NT2RP3002409 

NT2RP3002900 

NT2RP3003614 

NT2RP3004083 

NT2RP3004309 

NT2RP3004647 

NT2RP4001877 

OVARC 1000090 

OVARC1 000331 

OVARC 1000873 



OVARC1001049, OVARC1001086, OVARC1001132, 
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OVARC1001222 
OVARC 100 1607 
PLACE1 000740 
PLACE1001229 
PLACE1001918 
PLACE 1002547 
PLACE1 003407 
PLACE1 003839 
PLACE1 004279 
PLACE1 004887 
PLACE1005519 
PLACE1 005745 
PLACE1 006290 
PLACE1 007626 
PLACE1009196 



OVARC1001260 
OVARC1001807 
PLACE1 000927 
PLACE1 001340 
PLACE 1002080 
PLACE1 002726 
PLACE1 003460 
PLACE1 003845 
PLACE 1004282 
PLACE1 005003 
PLACE1 005544 
PLACE1 005768 
PLACE 1006443 
PLACE1 007702 
PLACE 1009600 



NT2RM4000100, 

NT2RM4000761 , 

NT2RM4001768, 

NT2RP1 000325, 

NT2RP2000479, 

NT2RP2000712, 

NT2RP2001529, 

NT2RP2001956, 

NT2RP2002527, 

NT2RP2002976, 

NT2RP2003664, 

NT2RP2004205, 

NT2RP2004794, 

NT2RP2005247, 



NT2RP2006134, 
NT2RP3000232, 
NT2RP3000721, 
NT2RP3001195, 
NT2RP3001754, 
NT2RP3002448, 
NT2RP3003076, 
NT2RP3003939, 
NT2RP3004090, 
NT2RP3004345, 
NT2RP4000108, 
NT2RP4001879, 
OVARC1000105, 
OVARC 1 000410, 
OVARC1000916, 



OVARC1 001336, 
OVARC1001991 
PLACE1001016, 
PLACE1001407, 
PLACE1 002095, 
PLACE1 002905, 
PLACE1 003573, 
PLACE1 004078, 
PLACE1 004441, 
PLACE1 005005, 
PLACE1 005660, 
PLACE1005815, 
PLACE1 006809, 
PLACE1 007845, 
PLACE1 003735, 



HEMBA1 005337, 
HEMBA1 006335, 
HEMBA1 006770, 
HEMBB1000905, 
HEMBB1 002041, 
MAMMA1 000204, 
MAMMA1000814, 
MAMMA1001344, 
MAMMA1 002087, 
MAMMA1003126, 



NT2RM2000423 

NT2RM4000198 

NT2RM4000997 

NT2RM4001843 

NT2RP1 000465 

NT2RP2000533 

NT2RP2000903 

NT2RP2001538 

NT2RP2002015 

NT2RP2002533 

NT2RP200313B 

NT2RP2003931 

NT2RP2004495 

NT2RP2004837 

NT2RP2005378 

NT2RP2006269 

NT2RP3000436 

NT2RP3000818 

NT2RP3001240 

NT2RP3002015 

NT2RP3002721 

NT2RP3003354 

NT2RP3003963 

NT2RP3004119 

NT2RP3004374 

NT2RP4000634 

NT2RP4002187 

OVARC1000137 

OVARC1 000439 

OVARC 1000956 

OVARC1001569 
PLACE1 000231 
PLACE1001100 
PLACE1 001464 
PLACE 1002329 
PLACE1 002911 
PLACE 1003598 
PLACE1004166 
PLACE1 004482 
PLACE1 005031 
PLACE1 005669 
PLACE1 006073 
PLACE 1006959 
PLACE 1008282 
PLACE 101 0081 



HEMBA1 005698, 
HEMBA1 006357, 
HEMBA1006912, 
HEMBB1001026, 
HEMBB1 002427, 
MAMMA1 000403, 
MAMMA1 000881, 
MAMMA1001615, 
MAMMA1 002095. 
NT2RM1000462, 



NT2RM2000582 
NT2RM4000284 
NT2RM4001321 
NT2RP1 000002 
NT2RP1 000740 
NT2RP2000610 
NT2RP2001276 
NT2RP2001662 
NT2RP2002188 
NT2RP2002564 
NT2RP2003210 
NT2RP2003940 
NT2RP2004524 
NT2RP2004847 
NT2RP2005425 

NT2RP2006512 

NT2RP3000460 

NT2RP3000820 

NT2RP3001271 

NT2RP3002324 

NT2RP3002737 

NT2RP3003448 

NT2RP3004025 

NT2RP3004130 

NT2RP3004557 

NT2RP4001001 

NT2RP4002451 

OVARC1 000208 

OVARC1 000553 

OVARC1 000995 

OVARC1 001570 
PLACE 1000258 
PLACE1001114 
PLACE1001788 
PLACE1 002374 
PLACE1 002967 
PLACE 1003644 
PLACE1004168 
PLACE1 004492 
PLACE 1005250 
PLACE 1005725 
PLACE 1006208 
PLACE1007028 
PLACE1 008469 
PLACE1010251 



HEMBA1 005929 
HEMBA1 006430 
HEMBA1 006960 
HEMBB1001048 
HEMBB1 002663 
MAMMA1 000449 
MAMMA1 000986 
MAMMA1001634 
MAMMA1002165 
NT2RM1 000580 

NT2RM2000589 
NT2RM4000444 
NT2RM4001325 
NT2RP1000181 
NT2RP1 000981 
NT2RP2000616 
NT2RP2001388 
NT2RP2001878 
NT2RP2002232 
NT2RP2002721 
NT2RP2003390 
NT2RP2004069 
NT2RP2004556 
NT2RP2005027 
NT2RP2005535 

NT2RP3000011 
NT2RP3000645 
NT2RP3000838 
NT2RP3001388 
NT2RP3002342 
NT2RP3002738 
NT2RP3003473 
NT2RP3004067 
NT2RP3004133 
NT2RP3004625 
NT2RP4001009 
NT2RP4002715 
OVARC1 000298 
OVARC1 000775 
OVARC1001030 



OVARC1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 



001596 
000442 
001123 
001795 
002518 
003163 
003772 
0041 99 
004637 
005410 
005736 
006219 
007296 
008657 
010713 
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PLACE1011116 S PLACE1011181, 

PLACE1011236, PLACE1011516, PLACE1011708, PUCE1011824 PLACE1011978 PLACE200011H 

WRoTooTl 8 2 1 8 THYROI ooi ?n^' T^™^ TH^oS, TH^foZ .' 

™yro omlS' ^uvo^ ^ 205 ' TH^ 01001242 - THYRO1001495, THYRO1001523, THYRO1001529 
™ Y A R ° 001593 ' THYRO1001608, THYRO1001702, THYRO1001725. THYRO1001770 THYRO1001803 
Y79AA1000117, Y79AA1 000207, Y79AA1 000226. Y79AA100O270, Y79AA1000426 Y79AA1 000777' 
SSE £52" : 7 79AA1000888 - V79AA1000959. Y79AA1001013, Y79AA1001oX Y79AA100109o' 
Y79AA1001264, Y79AA1001272, Y79AA1001328, Y79AA1001427, Y79AA1001430 Y79AA 00 530 
Y79AA1001592, Y79AA1001727, Y79AA1001793, Y79AA1001799 Y79M100 863 Y79M oS 
Y79AA1002213,Y79AA1002373 : Y79AA1002376,Y79AA1002381 Y79AA1002022, 
[0216] Signal ratios of EC_AGE_BSA to EC_BSA and of EC_gly Ca ted_BSA to EC_BSA were calculated for each 
gene. Genes w,th h,gh signal ratios were selected. In the case of calculating the ratio of signal value of 40 or less to 

EC apTr^po Va ' UeS Were ' f ° r convenience . tal ™ a * 40 instead of the real values. When the ratio 

ZZ h T , f ~ * ' S ° r m ° re - ex P ression of «"e genes exhibiting such ratio is expected to be elevated due to 
advanced glycat.on endproduct of bovine serum albumin. The higher the value is. the higher the gene expression level 

oeTdT* r ra,i ff ° e ^h ge - b i a/ec - bsa ranges fr ° m °- s ,o 2 - ex P ressi ° n * '*» °— «nSS ssit; 

Pr S ^ f edduetoadvanced 9^ionendproductofbovineserumalbumin.Whenthera.k>ECAGE BSA/ 
EC_BSA ,s less than 0.5, expression of the genes exhibiting such ratio value is expected to be decreased due to 

^ C yCat,On . endpr0dUCt ° f b ° Vine Semm albumin • The lower the value is - tha '°vver the gene expression level is 
Y79M1 000967 EC-AG E_BSA/BC_BSA ratio of 2 or higher are as follows: NT2RP2001 5^8, NT2RP4001 OoCd 

[0218] These cDNAs are associated with diabetes. 
Analysis of genes associated with neural cell differentiation 

n?nlcL it h eneS inVOlVed in neUra ' Ce " differen,ia,ion are U! *ful for treating neurological diseases. It is possible that 
wTh neriog^^s:"' 0 " V6,S reSP ° nSe 10 indUCti ° n ° f Ce " Ular differentiati ° n neUral ce,,S ™ associa * d 

[0220] A survey was performed for genes of which expression levels are varied in response to induction of differen- 
tiation (stimulation by retinoic acid (RA)) in cultured cells of a neural strain NT2 

mfn! J? 06,18 Were , ,reated basical * accordin 9 *° supplier's instruction manual. -Undifferentiated NT2 cells- 
means NT2 cells successively cultured in an Opti-MEM I (GIBCO-BRL; catalog No. 31 985) containing 1 0%(v/v) fetal 
bovine serum and ^(v/v) penicillin-streptomycin (GIBCO BRL). "NT2 ceils cultured in the presence of retinote «£ 

cTnsJts of dTeZ?^!?™™! 7*2 i3ted NT2 Ce " S int ° a r6tin0iC aci ^n-ng medium, which 

10 M rill h n !; Ca,a, ° 9 N °- 1 1 965) ' 1 ° %(V/V) ,e,al bovine serum ' 1 % < v/v ) Penicillin-streptomycin and 

10*1 retinoic acd (GIBCO-BRL), and the subsequent successive culture therein for 5 weeks. "NT2 cells that were 

N U T2 Sri.!,Cf ° f ret ! n0fc aCld and ,h6n ,Urth6r CUltUred in ,ne presence of ^"-division inhibitor added" means 
NT2 cells resulted from transferring NT2 cells cultured in the presence of retinoic acid for 5 weeks into a cell-division 
nh bitor-contaimng medium, which consisted of D-MEM(GIBCO BRL; catalog No.1 1 965), 10%(v/v) fetal bovine serum 
%(v/v) ^^streptomycin, 10. M retinoic acid, 10.M FudR (5-f.uoro-l-deoxyun L: 2 
I 'o^ee^aTh V2 ( C ^ Sine - D - Arabi -'- a -ide: GIBCO BRL), and the subsequence success^ 

TZL l c k, A °™ he Ce " S Were ,reated Wtth trypsin and tnen hfUVested - Total RNAs were extracted from 

the cells by using S.N.A.P.C™) Total RNA Isolation kit (Invitrogen). The labeling of probe used for hybridization was 

innate S Tl °?? * ** » ab0Ve - The data 

n t nphcate (n=3). The data of signal value representing gene expression level in the cells in the presence of stimulation 

Lv 2E2i T ? '°r r re C ° mPared W " h th ° Se " th6 abSGnCe ° f the S,imula,ion - The co -P a -on was perfoS 
unde the condition of p<0.05. In th.s analysis, clones with the difference can be statistically detected even when the 

° W < Accordin 9'y' clones with s '9" a ' value of 40 or .ess were also assessed for the selection 
[0222] Tables 1 86-365 show the expression level of each cDNA in undifferentiated NT2 cells, NT2 cells cultured in 
the presence o RA, and NT2 cells that were cuftured in the presence of RA and that were further ^clrfd in the 
presence of cell-division inhibitor added. eu in me 

S. AV T a . 9ed V . a ' UeS (M " Ma) and Samp,e variances < S 1 2 ' s 2 2 > were calculated for each gene in each of 
he cells, and then the pooled sample variances s* were obtained from the sample variances of the two typLs of ceis 
to be compared The t values were determined according to the following formula: t=(M1-M2)/s/(1/3 + 1/3)'A> When lhe 

?n1h^dr s tHi Ue , Wa f 9re ,f;^ han 8 , ' Va ' Ue a ' P ' WhiCh meanS the Pr0babi,itv of s'9nificance level, of 0.05 or 0.01 
he Lnt SS , *5 T 4 d69reeS ° f freed ° m ' the differenCe was j udaed 10 be found ^ the expression level of 
the gene between the two types of ce.ls at p<0.05 or p<0.01 , respective*. The tables also inc.ude the information on 
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an increase (+) or decrease (-) in the expression level of a gene in the treated cells when the level is compared with 
that of untreated undifferentiated cells. 

[0224] Clones of which expression levels increased by RA are as follows: HEMBA1 000121, HEMBA1000275 



HEMBA1 000300, 

HEMBA1001886, 

HEMBA1003120, 

H EM BA 1004444, 

HEMBA1 006299, 

HEMBA1 007063, 

HEMBB1 000668, 

HEMBB1 002693, 

MAM MA 1000881, 

NT2RM2000497, 

NT2RM4000593, 

NT2RP1 000465, 

NT2RP2000610, 

NT2RP2001878, 

NT2RP2002974, 

NT2RP2003664, 

NT2RP2004670, 

NT2RP2005535, 

NT2RP2006099, 

NT2RP3000460, 

NT2RP3001044, 

NT2RP3001542, 

NT2RP3003000, 

NT2RP3004025, 

NT2RP3004640, 

OVARC1 000208, 

OVARC1000916, 

OVARC1002178, 

PLACE1001123, 

PLACE1 003428, 

PLACE1 005669, 

PLACE1 006208, 

PLACE1010445, 

THYR01 000580, 

THYRO1001327, 

Y79AA1 000270, 



HEMBA1 000634, 
HEMBA1002163 : 
HEMBA1 003294, 
HEMBA1005230, 
H EM BA1 006357, 
HEMBA1 007241, 
HEMBB1001026, 
MAMMA1000106 : 
MAMMA1001634, 
NT2RM2000582, 
NT2RM4001321, 
NT2RP 1000468, 
NT2RP2000663, 
NT2RP2001015, 
NT2RP2002976, 
NT2RP2003940, 
NT2RP2005069, 
NT2RP2005541 , 
NT2RP2006134, 
NT2RP3000481, 
NT2RP3001061, 
NT2RP3001592, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP4000108, 
OVARC 1000275, 
OVARC1 000995, 
PLACE1 000258, 
PLACE1001795, 
PLACE1 003644, 
PLACE 1005682, 
PLACE 1007296, 
PLACE1011708, 
THYROI 000776, 
THYRO1001523, 
Y79AA1001056, 



HEMBA1000671, HEMBA1000875, 

HEMBA1 002227, HEMBA1 002420, 

HEMBA1 003497, HEMBA1 004007, 

H EM BA 1 005246 , H E M BA 1 005267, 

HEMBA1 006517, HEMBA1006544, 

HEMBB1 000447, HEMBB1000542, 

HEMBB1 001 847, H EMBB1 002051 , 

MAMMA1 000141, MAMMA1 000473, 

MAMMA1001957, MAMMA1 002205, 

NT2RM2001126, NT2RM2001902, 



NT2RP1 000002, 
NT2RP1 000579, 
NT2RP2000694, 
NT2RP2002304, 
NT2RP2003179, 
NT2RP2004069, 
NT2RP2005247, 
NT2RP2005774, 
NT2RP3000011, 
NT2RP3000652, 
NT2RP3001170, 
NT2RP3001976, 
NT2RP3003532, 
NT2RP3004090, 
NT2RP4000634, 
OVARC1 000553, 
OVARC1001030, 
PLACE 1000442, 
PLACE1 002518, 
PLACE1 003839, 
PLACE1 005736, 
PLACE1 007626, 
PLACE1011978, 
THYR01 000999, 
THYRO1001725, 
Y79AA1001062, 



NT2RP1 000050, 
NT2RP1 000679, 
NT2RP2000903, 
NT2RP2002721, 
NT2RP2003302, 
NT2RP2004108, 
NT2RP2005425, 
NT2RP2005878, 
NT2RP3000125, 
NT2RP3000677, 
NT2RP3001240, 
NT2RP3002790, 
NT2RP3003729, 
NT2RP3004130, 
NT2RP4002451, 
OVARC1000775 : 
OVARC1001049, 
PLACE1 000927, 
PLACE1 002547, 
PLACE 1004078, 
PLACE 1005768, 
PLACE 1008282, 
PLACE4000455, 
THYRO1001063, 
THYRO1001770, 
Y79AA1001090, 



HEMBA1001184 
HEMBA1002421 
HEMBA1004110 
HEMBA1005489 
HEMBA1006658 
H EM BB1 000567 
HEMBB1002120 
MAMMA1 000528 
MAM MA1 002224, 
NT2RM4000198 
NT2RP1000181 
NT2RP2000092 
NT2RP2001388 
NT2RP2002824 
NT2RP2003383 
NT2RP2004524 
NT2RP2005463 
NT2RP2005883 
NT2RP3000171 
NT2RP3000818 
NT2RP3001322 
NT2RP3002900 
NT2RP3003874 
NT2RP3004202 
NT2RP4002715, 
OVARC1 000853 
OVARC1001132 
FLACE1 000986 
PLACE1 002967 
PLACE1 004441 
PLACE 1005815 
PLACE 1008984 
SKNMC1 000004, 
THYRO1001128 
Y79AA1 000207, 
Y79AA1001727 



HEMBA1001390 
HEMBA1 003072 
HEMBA1 004391 
HEMBA1005913 
HEMBA1 006749 
HEMBB1 000642 
HEMBB1 002228 
MAMMA1 000810 
NT2RM2000423 
NT2RM4000284 
NT2RP1 000261 
NT2RP2000479 
NT2RP2001538 
NT2RP2002942 
NT2RP2003469 
NT2RP2004556 
NT2RP2005514 
NT2RP2005887 
NT2RP3000232 
NT2RP3000820 
NT2RP3001388 
NT2RP3002983 
NT2RP3003939 
NT2RP3004294 
OVARC1 000090 
OVARC1 000873 
OVARC1001596 
PLACE1001100 
PLACE1 003407 
PLACE1 004450 
PLACE1 006073 
PLACE1 008985 
THYR01 000036 
THYRO1001205 
Y79AA1 000226 
Y79AA1002213 



Y79AA1 002381. 

[0225] Clones of which expression levels decreased by RA are as follows: BNGH41 000020, HEMBA1 005070, 
NT2RP2005027, NT2RP3003473, Y79AA1 002376. 

[0226] Clones of which expression levels increase by RA/inhibitor are as follows: 
HEMBA1000128, HEMBA1 000875, HEMBA1001390, HEMBA1002163, HEMBA1 002227, 



H EM BA 1004391, 

HEMBA1 006335, 

MAMMA1001344, 

NT2RP2001388, 

NT2RP2004837, 

NT2RP2005887, 

NT2RP3002721, 

NT2RP3004294, 

OVARC1 000467, 

OVARC1001030, 

PLACE1001123, 

PLACE1 003407, 

PLACE1 005601, 

PLACE1006716, 



H EM BA1 004454, 
HEMBA1 006544, 
NT2RM2000582, 
NT2RP2002674, 
NT2RP2005069, 
NT2RP3000460, 
NT2RP3002738, 
NT2RP3004640, 
OVARC 1000553, 
OVARC1001222, 
PLACE1001407, 
PLACE1 003644, 
PLACE 1005725, 
PLACE1 007296, 



HEMBA1 004785, 
HEMBA1 007241, 
NT2RP1001004, 
NT2RP2002974, 
NT2RP2005425, 
NT2RP3000838, 
NT2RP3003469, 
NT2RP4000108, 
OVARC1 000853, 
OVARC1 001596, 
PLACE1 001464, 
PLACE1 003845, 
PLACE1 005736, 
PLACE1 007626, 



HEMBA1005913, 
HEMBB1000447, 
NT2RP2000663, 
NT2RP2003383, 
NT2RP2005463, 
NT2RP3001044, 
NT2RP3004083, 
NT2RP4002451, 
OVARC1 000873, 
OVARC1 002058, 
PLACE1001564, 
PLACE 1004441, 
PLACE1 006093, 
PLACE1 008359, 



HEMBA1006171, 
HEMBB1 000668, 
NT2RP2000694, 
NT2RP2004069, 
NT2RP2005541, 
NT2RP3001240, 
NT2RP3004130, 
NT2RP4002715, 
OVARC1000916, 
OVARC1002178, 
PLACE1001795, 
PLACE1 004482, 
PLACE1006219, 
PLACE1010968, 



HEMBA1 002421, 
HEMBA1 006299, 
MAMMA1 000994, 
NT2RP2000903, 
NT2RP2004606, 
NT2RP2005883, 
NT2RP3001388, 
NT2RP3004202, 
OVARC1 000275, 
OVARC1 000995, 
PLACE1 000927, 
PLACE1 002547, 
PLACE1 005410, 
PLACE1 006290, 
PLACE1011364, 
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PLACE1011824, THYRO1000678, THYRO1000776, THYRO1000999, THYRO1001113 THYRO1001?^7 

THYR01 001 523, Y79AA1 000226, Y79AA1 000888. Y79AA1 001 430. »^HO1001113. THYRO1001237, 

E2a«™«™, ° , LJ * hich ex P r ession levels decrease by RA/inhibitor are as follows: HEMBA1000349 

HEMBA1001297, HEMBA1001878, HEMBA1005070, HEMBA1006482, HEMBB1001959 NT2RM2001939 

NT2RP2001469 ' "T2RP3003473. OVARC1001132,' PLACETS; SSS 

2,,^°' * hiCh eXDression levels incre ** in the presence of both RA and RA/inhibitor are as follows- 
HEMBA1000875, HEMBA1001390, HEMBA1002163, HEMBA1002227 HEMBA1002421 HEMBaImSS" 
HEMBA1005913. HEMBA1006299, HEMBA1 006544, HEMBA1007241,' HmlZTZj, HEMBB1000668.' 



NT2RM2000582, 
NT2RP2003383, 
NT2RP2005883, 
N12RP3004083, 
NT2RP4002451, 
OVARC1000916, 
PLACE1001123, 
PLACE1 005736, 



NT2RP2000663, 
NT2RP2004069, 
NT2RP2005887, 
NT2RP3004130, 
NT2RP4002715, 
OVARC1000995 : 
PLACE1001795, 
PLACE 1007296, 



NT2RP2000694, 
NT2RP2005069, 
NT2RP3000460, 
NT2RP3004202, 
OVARC1 000275, 
OVARC1001030, 
PLACE1 002547, 
PLACE1 007626, 



NT2RP2000903, 
NT2RP2005425, 
NT2RP3001044, 
NT2RP3004294, 
OVARC1 000553, 
OVARC1001596, 
PLACE1 003407, 
THYR01 000776, 



NT2RP2001388, 
NT2RP2005463, 
NT2RP3001240, 
NT2RP3004640, 
OVARC1 000853, 
OVARC1002178, 
PLACE1 003644, 
THYR01 000999, 



NT2RP2002974, 
NT2RP2005541 , 
IMT2RP3001388, 
NT2RP4000108, 
OVARC1 000873, 
PLACE 1000927, 
PLACE1 004441, 
THYRO1001523, 



Y79AA1 000226. 

^^l^lSiSSSS^ decrease in ,he presence of b0,h RA and RA/inhibitor are as foll °- 

[0230] These are neurological disease-associated clones. 
Analysis of rheumatoid arthritis-associated genes 

surfaces ^Zl^TT^T^ * * be inV °' Ved h the P rolif ^tion of synovia, cells covering inner 

surfaces of o nt cav,ty and ,n inflammatory reaction resulted from the action of cytokines produced by leukocytes 
,nf,ltrat.n gi ntothejo,nts^^^^^ 

1 ^Twtnt WSSUe ™T ,aCt ° r (TNFK Part,CiPateS in ,he ° nSet < CU ™< OP™" in J luXT 9 ?9 
11 , 657-662)^ When the express.on of a gene exhibits responsiveness to the action of TNF on synovial cells the qene 

is considered to be involved in rheumatoid arthritis. 9 
[0232] A survey was performed for genes of which expression levels are varied in response to TNF- in the primary 
cell culture of synovial tissue. The primary cultured cells of the smooth muscle (Cell Applications) were grown o be 
confluent ,n a culture dish, and then, human TNF, (Boehringer-Mannheim) was added at a final concentZn of W 
ng/ml thereto. The culture was further continued for 24 hours concentration ot 1 0 

[0233] Total RNA was extracted from the cells by using S.N.A.P.C™) Total RNA Isolation kit (Invitrogen) The labelina 
iHT: f ° r T h h y ^ di2a,ion was carried «* ^ using io. 9 of the total RNA according to th ^ame metfS £ 
descnbed above. The data were obtained in tripricate (n=3). The data of signal value representing gene expression 
level ,n the cells ,n the presence of TNF stimulation were compared with those in the absence o the stimufcJon The 
comparison was performed by statistical treatment of two-sample t-test. Cones wrth signtficant J^JSESZ 
d.tnbution were selected under the condition of p<0.05. In this analysis, clones with the difference can Z statS 

Si9na ' S ^ Wi,h Si9na ' Va,Ue of 40 or less — *o asseLS 

[0234] Table 366 shows the expression level of each cDNA in synovial cells cultured in the absence or presence of 

Els tndTn fh 9nal 7T (M :- Mz) Sa T P,e V8rianCeS ^ for each W were calculated in each of 
he cells, and then the pooled sample variances s* were obtained from the sample variances of the two types of cells 

deten^Z 8 , , ^'^ detefmined * ,he f °"° Win9 f <»™ ,a: t=(M 1 -M 2 )/s/(1/3 + 1/3)^ Whence 

T , Wa f 9r6 f " ,ha " 8 t " Va ' Ue at P ' Whfch meanS ,he P r0babili, y ° f 'eve», of 0.05 or 0 ot 

he oenfh f ,? t "? * d69reeS °' freed0m ' * e difference Was > U(S ^ to be fou " d "> the expression level of 

Zu^lZ'Zl^ ,n he express,on ,evel of a gene in ,he stimulated ce,,s when the level is ""p— 

[0236] Clones of which expression levels are elevated by TNF- are as follows- 

BNGH41000020, HEMBA1000349, HEMBA1000634, HEMBA1000671, HEMBA1000835 HEMRAmnnow 

HEMBA1002178, HEMBA1002195, HEMBA1002239, HEMBA1002420 HEMBA 002524 HEMBA1 002QQ?' 

HEMBA1003315, HEMBA1003392, HEMBA1003487, HEMBA1003602 HdX uS' hIZmZIIt 

HEMBA1005337, HEMBA1005489, HEMBA1006916, HEMBB1000668 HEMBB1 000905 ' H^MBB Zll? 

HEMBB1001573, HEMBB1002041, HEMBB1002663, MAMMA1000652, MAMMA1000810 MAMMA1 6oi 634' 
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MAMMA1 002091 , 
NT2RP1 000740, 
NT2RP2005163, 
NT2RP3001976, 
NT2RP3004647, 
OVARC1 000275, 
PLACE1001100, 
PLACE1 003644, 
PLACE1 006809, 
Y79AA1 000270, 



MAMMA1 002234.. 
NT2RP2000694, 
NT2RP2005463, 
NT2RP3003473, 
NT2RP4000108, 
OVARC1 000439, 
PLACE1 001464, 
PLACE1004519, 
PLACE1008716, 
Y79AA1 000876, 



NT2RM2000306, 
NT2RP2001921, 
NT2RP2006134, 
NT2RP3003874, 
NT2RP4001001, 
OVARC1001607, 
PLACE1001500, 
PLACE1 005031, 
PLACE1010081, 
Y79AA1001013, 



NT2RM4000417, 
NT2RP2002527, 
NT2RP3000171, 
NT2RP3004090, 
NT2RP4001877, 

PLACE1 000740, 
PLACE1001918, 
PLACE1 005410, 
THYRO1001770, 
Y79AA1001264, 



15 



Y79AA1001430, Y79AA1001530, Y79AA1001799. 

[0237] Clones of which expression levels decrease by TNF-. are as follows; 
NT2RM4000326, NT2RP1000300, NT2RP2000514, NT2RP2001755, 
NT2RP3002790. These are rheumatoid arthritis-associated clones. 

EXAMPLE 16 



NT2RP1 000002, 
NT2RP2004495, 
NT2RP3000652, 
NT2RP3004294, 
OVARC1 000090, 
PLACE 1000927, 
PLACE1 002095, 
PLACE1 005736, 
Y79AA1000127, 
Y79AA1001272, 



NT2RP1000181, 
NT2RP2004606, 
NT2RP3001195, 
NT2RP3004557, 
OVARC1000105, 
PLACE1001016, 
PLACE1 002547, 
PLACE1006219, 
Y79AA1 000207, 
Y79AA1001328, 



NT2RP2006042, NT2RP3000481 , 



Search for a signal sequence, transmembrane region and functional domain in deduced amino acid sequences 

20 [0238] The deduced amino acid sequences from the full-length nucleotide sequences were examined to predict the 
presence of a signal sequence in their ami no-termini as well as the presence of a transmembrane region. The amino 
acid sequences were also searched for a protein functional domain (motif). The examinations for a signal sequence 
in the amino-terminus, for a transmembrane region and for a functional domain were performed by using PSORT [K. 
Nakai & M. Kanehisa, Genomics, 14:897-911 (1992)], SOSUI [T. Hirokawa et al., Bioinformatics, 14:378-379 (1998)] 

25 (Mitsui Knowledge Industry Co., Ltd.) and Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml ), respectively. 
When the presence of a signal sequence or a transmembrane region in the amino-terminus was predicted in the amino 
acid sequence by PSORT or SOSUI, the protein was predicted to be a secretory protein or a transmembrane protein. 
When the amino acid sequence matched a functional domain in the Pfam search for a functional domain, the function 
of the protein is predictable based on the matching data, for example, by referring to the functional categories in 

30 PROSITE (http://www.expasy.ch/cgi-bin/prosite-list.pl ). The functional domain search can be performed by using 
PROSITE instead of Pfam. 

[0239] Search results obtained by using the respective software programs are indicated below. 

[0240] Clones whose deduced amino acid sequences were predicted to have signal sequences by PSORT search 

are as follows: 

35 HEMBA1000713, HEMBA1002420, HEMBA1002421 , HEMBA1003101 , HEMBA1004110, HEMBA1006707, 
HEMBA1006902, HEMBB1001530, HEMBB1001573, HEMBB1001978, HEMBB1002162, HEMBB1002245, 
HEMBB1002427, MAMMA1000102, MAMMA1 000118, MAMMA1000457, MAMMA1 001043, MAMMA1001 344, 
MAMMA1001893, MAMMA1 002070, MAMMA1002165, MAMMA1 002633, NT2RM2000241 , NT2RM2000410, 
NT2RM2001941, NT2RM4001325, NT2RP1001563, NT2RP2001495, NT2RP2002063, NT2RP2002721 , 

40 NT2RP2003383, NT2RP2003593, NT2RP2003655, NT2RP2003664, NT2RP2004179, NT2RP2004205, 
NT2RP2004524, NT2RP2005463, NT2RP3000460, NT2RP3001012, NT2RP3001858, NT2RP3002836, 
NT2RP3003076, NT2RP3003532, NT2RP3004133, NT2RP3004309, NT2RP4001467, NT2RP4002451 , 
OVARC1000298, OVARC1000811 , PLACE1000231 , PLACE1000740, PLACE1001183, PLACE1001536, 
PLACE1001564, PLACE1 002095, PLACE1 002374, PLACE1 003839, PLACE1 001482, PLACE1 005005, 

45 PLACE1 005250, PLACE1 005383, PLACE1 005410, PLACE1 005544, PLACE1 005569, PLACE1 006093, 
PLACE1006277, PLACE1006809, PLACE1007626, PLACE1008359, PLACE1009067, PLACE1010251 , 
PLACE1011236, SKNMC1000004, SKNMC1000014, THYRO1000099, THYRO1000196, THYRO1001237, 
THYRO1001327, THYRO1001523, THYRO1001702, THYRO1001725, Y79AA1 000426, Y79AA1000521, 
Y79AA1 000959, Y79AA1001013, Y79AA1001264, Y79AA1001328, Y79AA1001427, Y79AA1001430, 

50 Y79AA1 001 795, Y79AA1 001 803, Y79AA1 002022, 

[0241] Clones whose deduced amino acid sequences were predicted to have transmembrane regions by SOSUI 
search are as follows: BNGH41000091, HEMBA1000121 , HEMBA1000349, HEMBA1000477, HEMBA1000713, 
HEMBA1000940, HEMBA1000962, HEMBA1001221 , HEMBA1001228, HEMBA1001621 , HEMBA1002167, 
HEMBA1002195, HEMBA1002227, HEMBA1002421 , HEMBA1003101 , HEMBA1003392, HEMBA1003530, 

55 HEMBA1003732, HEMBA1003945, HEMBA1004391 , HEMBA1004454, HEMBA1004797, HEMBA1004982, 
HEMBA1005449, HEMBA1005522 ; HEMBA1 005545, HEMBA1005698, HEMBA1006171 , HEMBA1006299, 
HEMBA1006311, HEMBA1006335, HEMBA1006357, HEMBA1006430, HEMBA1006724, HEMBA1006960, 
HEMBB1000407, HEMBB1000447, HEMBB1 000567, HEMBB1000679, HEMBB1000905, HEMBB1001 026, 
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HEMBB1001407, HEMBB1001573, HEMBB1001978, HEMBB1002Q41, HEMBB1002162 HEMBB1002245 

HEMBB1 002427, HEMBB 1002693, MAMMA1000102, MAMMA1000106, MAMMA1000118,' MAMMA1 000141* 

MAMMA1000204, MAMMA1 000226, MAMMA1000457, MAM MA1 000473, MAMMA1 000591 , MAMMA1000681 ' 

MAMMA1000810, MAMMA1000986, MAMMA1001043, MAMMA1001141 , MAMMA1001237 MAMMA1001344' 

MAMMA1001893, MAMMA1001957, MAMMA1001978, MAMMA1002070, MAMMA1002091 , MAMMA1002095' 

MAMMA1002633, NT2RM1000580, NT2RM1000855, NT2RM1000858, NT2RM2000410" NT2RM200056s' 

NT2RM2001626, NT2RM2001939, NT2RM2001941, NT2RM4000444, NT2RM4G00587' 

NT2RM4000997, NT2RM4001325, NT2RM4001735, NT2RM4001768, MT2RM4002352 

NT2RP1000181, NT2RP1000261, NT2RP1 000300, NT2RP1 000448, NT2RP1 000551 ' 
NT2RP1000981, NT2RP1001563, NT2RP2000479, NT2RP2000533, NT2RP2000649 



NT2RM4000648, 
NT2RP1 000050, 
NT2RP1 000613, 



NT2RP2000663 
NT2RP2001495 
NT2RP2002533 
NT2RP2003390 
NT2RP20041 79 
NT2RP2004648 
NT2RP2005666 
NT2RP2006580 l 
NT2RP3000721, 
NT2RP3001240, 
NT2RP3001650, 
NT2RP3002790, 
NT2FP3003532, 
NT2RP3004090, 
NT2RP3004481, 
NT2RP4002187, 



NT2RP2000694, 
NT2RP2001915, 
NT2RP2002721, 
NT2RP2003469, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005883, 
NT2RP3000169, 
NT2RP3000818, 
NT2RP3001271, 
NT2RP3001738, 
NT2RP3002836, 
NT2RP3003535 : 
NT2RP3004130, 
NT2RP3004552, 
NT2RP4002451 . 



NT2RP2000818, 
NT2RP2001956, 
NT2RP2002824, 
NT2RP2003593, 
NT2RP2004495, 
NT2RP2005163, 
NT2RP2005994, 
NT2RP30001 71 , 
NT2RP3000907, 
hfT2RP3001322, 
NT2RP3002015, 
NT2RP3002900, 
NT2RP3003614.. 
NT2RP3004294, 
NT2RP4001001, 
NT2RP4002750, 



NT2RP2000903, 
NT2RP2002188, 
NT2RP2002942, 
NT2RP2003655, 
NT2RP2004524, 
NT2RP2005181, 
NT2RP2006004, 
NT2RP3000304, 
NT2RP3000921 , 
NT2RP3001388, 
NT2RP3002311, 
NT2RP3002958, 
NT2RP3004025 5 
NT2RP3004309, 
NT2RP4001009, 
OVARC1 000003, 



OVARC1 000439, OVARC1 000553, OVARC1001030, OVARC1001336, 



OVARC1 001570 
PLACE1 000927 
PLACE1001464 
PLACE1001949 
PLACE1 003839 
PLACE1004637 i 
PLACE 1 00541 0 ; 
PLACE1 005927, 
PLACE1 007626, 
PLACE1009196, 
PLACE1010078, 
PLACE1011516 ; 
THYRO1001102 
THYRO1001327 
Y79AA1 000207, 
Y79AA1001212, 



PLACE1 000231, 
PLACE1001016, 
PLACE1001536, 
PLACE 100251 8, 
PLACE 1003845, 
PLACE 1004648, 
PLACE1 005544, 
PLACE 1006290, 
PLACE1007881, 
PLACE1 009279, 
PLACE1010445, 
PLACE3000181, 
THYRO1001113, 



PLACE 1000560, 
PLACE1001183, 
PLACE1001564, 
PLACE1 002726, 
PLACE1 003852, 
PLACE1004816, 
PLACE1 005569, 
PLACE1 006443, 
PLACE1 008359, 
PLACE1 009527, 
PLACE1010713, 
THYRQ1 000400, 



PLACE 1000740, 
PLACE1001231, 
PLACE1001655, 
PLACE 1002967, 
PLACE1 004279, 
PLACE1 004887, 
PLACE 1005660, 
PLACE1 006959, 
PLACE 1008469, 
PLACE1 009546, 
PLACE1010784, 



NT2RP2001200, 
NT2RP2002232, 
NT2RP2002976, 
NT2RP2003664, 
NT2RP2004556, 
NT2RP2005463, 
NT2RP2006269, 
NT2RP3000460, 
NT2RP3001159, 
NT2RP3001560, 
NT2RP3002342, 
NT2RP3003000, 
NT2RP3004075, 
NT2RP3004345, 
NT2RP4001467, 
OVARC1000105, 

PLACE1000912, 
PLACE1001401, 
PLACE1001836, 
PLACE1 003573, 
PLACE1 004282, 
PLACE1 005003, 
PLACE1 005725, 
PLACE1 007096, 
PLACE1008716, 
PLACE1 009600, 
PLACE1010968, 
THYRQ1 000776, 



NT2RP2001276, 

NT2RP2002527, 

NT2RP2003042, 

NT2RP2003950, 

NT2RP2004606, 

NT2RP2005597, 

NT2RP2006512, 

NT2RP3000616, 

NT2RP3001195, 

NT2RP3001592, 

NT2RP3002411, 

NT2RP3003354, 

NT2RP3004083, 

NT2RP3004406, 

NT2RP4001879, 

OVARC1 000307, 

PLACE1000914, 
PLACE1001407, 
PLACE1001918, 
PLACE 1003737, 
PLACE 1004441, 
PLACE1 005005, 
PLACE1 005745, 
PLACE1 007296, 
PLACE1 008985, 
PLACE1010011, 
PLACE1011236, 
THYRQ1 000956, 



THYR0 1000678, 

THYR01 001 205, THYR01 001 237, THYR01 001 242, THYR01 00 1 266 
THYRO1001478, THYRO1001523, THYRO1001641 . THYRO1001702, THYRO1001725' 
Y79AA1 000226, Y79AA1 000270, Y79AA1 000521 . Y79AA1 000888, Y79AA1 001 013' 
Y79AA1001264, Y79AA1001328, Y79AA1001426, Y79AA1001427, Y79AA1001 727' 
Y79AA1 001 787, Y79AA1 001 795, Y79AA1 001 803, Y79AA1 002058. 
Y79AA1 0021 29, Y79AA1 00221 3, Y79AA1 002373, 

[0242J Names of clones whose deduced amino acid sequences were predicted to have functional domains by Pfam 
search, and names of the matched functional domains are shown below. When multiple functional domains matched 
a clone, each domain name was indicated, separated by a double-slash mark,//. 

HEMBA1000006//Src homology domain 3 
HEMBA1000128//SCP-like extracellular Proteins 
HEMBA1000349//ABC transporters 

HEMBA1000462//RNA recognition motif, (aka RRM, RBD. or RNP domain) 
HEMBA1000590//EGF-like domain//von Willebrand factor type A domain 
HEMBA1 000671//Zinc finger, C2H2 type 
HEMBA1000732//EGF-like domain 
HEMBA1 000940//Connexin 

HEMBA1001221//EGF-Iike domain//Kazal-type serine protease inhibitor domain 
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HEMBA1001621//7 transmembrane receptor (rhodopsin family) 

HEMBA1001878//WD domain, G-beta repeats 

HEMBA1002048//Zinc finger, C2H2 type 

HEMBA1002167//Carboxylesterases 
5 HEMBA1002551//WD domain, G-beta repeats 

HEMBA1002992//Ubiquitin family 

HEMBA1003047//CUB domain 

HEMBA1003120//Zinc finger, C2H2 type 

HEMBA1003230//EGF-like domain 
10 HEMBA1003392//Low-density lipoprotein receptor domain class A 

HEMBA1003497//Zinc finger, C2H2 type 

HEMBA1004250//Cadherin 

HEMBA1004391//Fibronectin type III domain//IG superfamily 
HEMBA1004454//4 transmembrane segments integral membrane proteins 
15 HEMBA1004785!/ , chromo* (CHRromatin Organization Modifier) domain 

HEMBA1005246//Zinc finger, C2H2 type 
HEMBA1005267//Ank repeat 

HEMBA1005545//7 transmembrane receptor (rhodopsin family) 
HEMBA1005929//Eukaryotic protein kinase domain 
20 HEMBA1005945//Mitochondrial carrier proteins 

HEMBA1006572//Zinc finger, C2H2 type 

HEMBA1006707//EGF-like domain//von Willebrand factor type A domain 

HEMBAi006749//EGF-like domain//von Willebrand factor type A domain 

HEMBA1006770//RNA recognition motif, (aka RRM, RBD. or RNP domain) 
25 HEMBA1006902//EGF-like domain//von Willebrand factor type A domain 

HEMBB1000106//Zinc finger, CCHC class 

HEMBB1000668//WD domain, G-beta repeats 

HEMBB1000881//Thrombospondin type 1 domain 

HEMBB1000905//WD domain, G-beta repeats 
30 HEMBB1002041//EGF-like domain//Kazal-type serine protease inhibitor domain 

HEMBB1002245//IG superfamily 

HEMBB1002302//Zinc finger, CCHC class 

HEMBB1 002465//Acyl-CoA dehydrogenases 

HEMBB1002661//Helix-loop-helix DNA-binding domain 
35 MAMMA1 000204//C2 domain 

MAMMA1000457//FAD/NAD-binding domain in oxidoreductases 

MAMMA1000681//7 transmembrane receptor (rhodopsin family) 

MAMMA1 00088 1//Eukaryotic protein kinase domain//Protein kinase C terminal domain 

MAMMA1001150//Phorbol esters / diacylglycerol binding domain//Eukaryotic protein kinase domain 
40 MAMMA1 001 31 0//WD domain, G-beta repeats 

MAMMA1001532//Zinc finger, C2H2 type 

MAMMA1001615//Helix-loop-helix DNA-binding domain 

MAMMA1 002070//Kringle domain 

MAMMA1002080//Ras family (contains ATP/GTP binding P-loop) 
45 MAMMA1 002095//E1 -E2 ATPases 

MAMMA1002165//lnsulin-like growth factor binding proteins 

NT2RM1000789//HMG (high mobility group) box 

NT2RM1000855//eubacterial secY protein 

NT2RM1000899//Mitochondrtal carrier proteins 
so NT2RM2000589//PH (pleckstrin homology) domain 

NT2RM2000632//Helicases conserved C-terminal domain 

NT2RM2001792//Fibrinogen beta and gamma chains, C-terminal globular domain 
NT2RM2001902//Eukaryotic protein kinase domain 
NT2RM2001939//7 transmembrane receptor (rhodopsin family) 
55 NT2RM2001 941 ill transmembrane receptor (rhodopsin family) 

NT2RM4000284//Class I Histocompatibility antigen, domains alpha 1 and 2 
NT2RM4000326//2inc finger, C2H2 type 
NT2RM4000417//C2 domain 
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NT2RM4Q00444//ABC transporters 
NT2RM4001377//PH (pleckstrin homology) domain 
NT2RM4001768//A!cohol/other dehydrogenases, short chain type 
NT2RM4002352//Low-density lipoprotein receptor domain class A 
NT2RP1000181//Heme-binding domain in cytochrome b5 and oxidoreductases 
NT2RP1000271//Zinc finger, C2H2 type 
NT2RP1000325//Mitochondrial carrier proteins 
NT2RP1000613//Eukaryotic-type carbonic anhydrases 
NT2RP1000981//IG superfamily 
m"2RP1 001 004//Thrombospondin type 1 domain 
NT2RP1 001 020//Eukaryotic protein kinase domain 
NT2RP1001031//WD domain, G-beta repeats 

NT2RP1001563//EGF-like domain//Lectin C-type domain short and long forms//SCP-like extracellular Proteins 

NT2RP2000092//Zinc finger, C2H2 type 

NT2RP2000514//Fibronectin type Ml domain//IG superfamily 

NT2RP2000649//Zinc-binding metal lop rotease domain 

NT2RP20Q0712//Zinc finger, C2H2 type 

NT2RP2000739//Zinc finger, C2H2 type 

NT2RP2001514//E1-E2 ATPases 

NT2RP2001529//Eukaryotic protein kinase domain 

NT2RP2Q01755//Thrombospondin type 1 domain 

NT2RP2001769//Eukaryotic protein kinase domain 

NT2RP20021 88//Carboxylesterases 

NT2RP2002527//Heme-binding domain in cytochrome b5 and oxidoreductases 

NT2RP2002564//Zinc finger, C2H2 type 

NT2RP2002942//IG superfamily 

NT2RP2003179//Eukaryotic protein kinase domain 

NT2RP2003302//Zinc finger, C2H2 type 

NT2RP2003390//DnaJ s prokaryotic heat shock protein 

NT2RP2003469//Sugar (and other) transporters 

NT2RP2003545//Eukaryotic protein kinase domain 

NT2RP2003593//Thioredoxins 

NT2RP2003940//Zinc finger, C2H2 type 

NT2RP2004108//Zinc finger, C2H2 type 

NT2RP2004205//IG superfamily 

NT2RP2004670//Eukaryotic protein kinase domain 

NT2RP2004847//Zinc finger, C2H2 type 

NT2RP2005181//Amino acid permeases 

NT2RP2005247//Zinc finger, C3HC4 type (RING finger) 

NT2RP2005391//Fibronectin type Ml domain 

NT2RP2005535//Zinc finger, C2H2 type 

NT2RP2005774//Zinc finger, C2H2 type 

NT2RP2005878//Alcohol/other dehydrogenases, short chain type 
NT2RP2005941//Homeobox domain// , Paired box' domain 
NT2RP2006004//Fibronectin type III domain 
NT2RP3000011//WD domain, G-beta repeats 
NT2RP3000022//Eukaryotic protein kinase domain 
IMT2RP3000053//Zinc finger, C2H2 type 
NT2RP3000148//Zinc finger, C2H2 type 
NT2RP3000172//Eukaryotic protein kinase domain 
NT2RP3000201//Eukaryotic protein kinase domain 
NT2RP3000232//Zinc finger, C2H2 type 

NT2RP3000304//Low-density lipoprotein receptor domain class A//Low-density lipoprotein receptor domain class 
B 

NT2RP3000436//Thioredoxins 
NT2RP3000460//eubacterial secY protein 
NT2RP3000616//Fibronectin type III domain 
NT2RP3000652//Zinc finger, C2H2 type 
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NT2RP3000676//Mitochondrial carrier proteins 

NT2RP3000789//KH domain family of RNA binding proteins 

NT2RP3000820//WD domain, G-beta repeats 

NT2RP3000838//PH (pleckstrin homology) domain 
5 NT2RP3000907//E1-E2 ATPases 

NT2RP3000921//IG superfamily 

NT2RP3001195//Sugar (and other) transporters 

NT2RP3001240//eubacterial secY protein 

NT2RP3001388//C2 domain 
10 NT2RP3001 650//CUB domain//Low-density lipoprotein receptor domain class A 

NT2RP3001738//Heme-binding domain in cytochrome b5 and oxidoreductases 

NT2RP3001976//Zinc finger, C2H2 type 

NT2RP3002281//RNA recognition motif, (aka RRM, RBD, or RNP domain) 

NT2RP3002411 //Alcohol/other dehydrogenases, short chain type 
15 NT2RP3002721//Citrate synthase 

NT2RP3003000//lon transport proteins 

NT2RP3003527//Eukaryotic protein kinase domain 

NT2RP3003535//TPR Domain 

NT2RP3003849//C2 domain 
20 NT2RP3004067//Src homology domain 3 

NT2RP3004090//Zinc finger, C3HC4 type (RING finger) 

NT2RP3004481//IG superfamily 

NT2RP3004552//CUB domain//Sushi domain 

NT2RP3004647//Mitochondrial carrier proteins 
25 NT2RP40001 08//lntermediate filament proteins 

NT2RP4000634//Eukaryotic protein kinase domain 

NT2RP4000962//Eukaryotic protein kinase domain 

NT2RP4001009//Zinc-binding metal lop rotease domain 

NT2RP4001877//RNA recognition motif, (aka RRM. RBD, or RNP domain) 
30 NT2RP40021 87//Alcohol/other dehydrogenases, short chain type 

NT2RP4002750//Amino acid permeases 

OVARC1000105//Ubiquitin-conjugating enzymes 

OVARC1000255//Eukaryotic protein kinase domain 

OVARC1 00031 3//Thioredoxins 
35 OVARC1 00041 0//Fibrinogen beta and gamma chains, C-terminal globular domain 

OVARC1000529//Eukaryotic protein kinase domain 

OVARC1 00081 1//CUB domain//Kringle domain 

OVARC1 00091 6//Eukaryotic protein kinase domain 

OVARC1001049//Helix-loop-helix DNA-binding domain 
40 OVARC1 001 338//Eukaryotic protein kinase domain 

OVARC1001569//Eukaryotic protein kinase domain 

OVARC1001570//Eukaryotic aspartyl proteases 

PLACE1 000231 //WAP-type (Whey Acidic Protein) lour-disulfide core 1 

PLACE1000258//Zinc finger, C2H2 type 
45 PLACE1000740//EGF-like domain 

PLACE1000907//Zinc finger, C2H2 type 

PLACE1001016//lon transport proteins 

PLACE1001500//Helicases conserved C-terminal domain 

PLACE1001655//lon transport proteins 
50 PLACE1 001 795//SCP-like extracellular Proteins 

PLACE1001949//E1-E2 ATPases 

PLACE1002329//Src homology domain 3 

PLACE1002355//Alpha-2-macroglobutin family//Kazal-type serine protease inhibitor domain 
PLACE1002374//Cysteine proteases 
55 PLACE1 00251 8//Zinc finger, C3HC4 type (RING finger) 

PLACE1 00291 1//IG superfamily 
PLACE 1 0031 35//Eukaryotic protein kinase domain 
PLACE 1 0031 63//Enoyl-CoA hydratase/isomerase 
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PLACE 1 003573//Lectin C-type domain short and long forms 
PLACE1 0041 66//Bromodomain 

PLACE 1004305//Ras family (contains ATP/GTP binding P-loop) 

PLACE1004441//7 transmembrane receptor (rhodopsin family) 
5 PLACE1004520//IGsuperfamily 

PLACE1 00481 6//Fibrinogen beta and gamma chains, C-terminal globular domain 

PLACE1 004887//Zinc finger, C3HC4 type (RING finger) 

PLACE1 005003//Trypsin 

PLACE1005383//EGF-like domain 
10 PLACE1 00541 0//eubacteria! secY protein 

PLACE1005426//IG superfamily 

PLACE1 00551 9//Eukaryotic protein kinase domain 

PLACE1005539//Heat shock hsp20 proteins 

PLACE1005544//IG superfamily 
1 $ PLACE1 005569//IG superfamily 

PLACE1005682//Zinc finger, C3HC4 type (RING finger) 

PLACE1005736//PH (pleckstrin homology) domain 

PLACE1 006079//Homeobox domain 

PLACE1 00671 6//C1q domain 
20 PLACE 1 008282//Eukaryotic protein kinase domain 

PLACE1 008549//Ets-domain 

PLACE1008744//EGF-like domairV/Sushi domain 

PLACE1009067//Src homology domain 3 

PLACE1010081//Eukaryotic protein kinase domain 
25 PLACE1010251//EGF-like domain 

PLACE1010713//Alcohol/other dehydrogenases, short chain type 

PLACE1 01 0784//7 transmembrane receptor (rhodopsin family) 

PLACE1010968//Fibronectin type II! domain 

PLACE1011181//ATPases associated with various cellular activities (AAA) 
30 PLACE1 01 1 364//Eukaryotic protein kinase domain 

PLACE1011407//Zinc finger, C2H2 type 

PLACE1011708//CUB domain 

PLACE1011824//Eukaryotic protein kinase domain 

PLACE1011978//Zinc finger C2H2 type 
35 PLACE3000181//Cadherin 

PLACE3000213//Sushi domain 

PLACB4000354//BGF-like domain//Sushi domain 

SKNMC1000082//Mitochondrial carrier proteins 

THYR01 0001 96//Cadherin 
^0 THYR01 000580//Zinc finger, C2H2 type 

THYR01 000678//Connexin 

THYR01000795//Mitochondrial carrier proteins 

THYRO1000956//7 transmembrane receptor (rhodopsin family) 

THYR01 001 113//C2 domain 
*5 THYR01 001266//Sodium:solute symporter family 

THYRO1001457//Phorbol esters / diacylglycerol binding domain//Bukaryotic protein kinase domain 
THYR01 001 478//EF hand 

THYRO1001593//Eukaryotic protein kinase domain 
THYR01001700//Eukaryotic protein kinase domain 
50 THYR01 001 770//Eukaryotic protein kinase domain 

Y79AA1000030//WW/rsp5/WWP domain containing proteins 
Y79AA1000426//Transforming growth factor beta like domain 
Y79AA1000777//WD domain, G-beta repeats 
Y79AA1 000876//Thioredoxins 
Y79AA1000967//Eukaryotic protein kinase domain 
Y79AA1001090//Ank repeat 

Y79AA1001264//DnaJ, prokaryotic heat shock protein 
Y79AA1 001 328//EGF-like domain 



55 
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Y79AA1001427//FAD/NAD-binding domain in oxidoreductases 
Y79AA1001523//Bromodomain//Zinc finger, C3HC4 type (RING finger) 
Y79AA1 001 530//Tubulin 
Y79AA1001727//IG superfamily 
5 Y79AA1001787//E1-E2 ATPases 

Y79AA1001799//Mitochondrial carrier proteins 
Y79AA1002022//IG superfamily 
Y79AA1002381//Eukaryotic protein kinase domain 



w EXAMPLE 17 



Functional categories based on the full-length nucleotide sequences 



[0243] Prediction of functions of proteins encoded by the clones and the categorization thereof were performed based 
is on the results of homology search (see homology search result 10) of the databases. GenBank, Swiss-Prot and Uni- 
Gene for the full-length nucleotide sequences of 826 clones as well as based on the results of domain search (see 
Example 16) of the deduced amino acid sequences encoded by the full-length nucleotide sequences. 
(HEMBA1 005337, NT2RM 1000407, NT2RM2001767, and NT2RP3003939 were excluded because of the absence of 
full-length sequence.) 

20 [0244] There are 611 clones that presumably encode proteins belonging to any of categories of secretory and/or 
membrane proteins, glycoprotein -associated proteins, signal transduction-associated proteins, transcription-associat- 
ed proteins and disease-associated proteins. 

[0245] The clones presumably encoding proteins categorized into secretory and/or membrane proteins are those 
which matched the full-length sequences of Swiss-Prot database with keywords "growth factor", "cytokine", "hormone", 

25 "signal", "transmembrane", "membrane", "extracellular matrix", "receptor", "G-protein coupled receptor", "ionic chan- 
nel", "voltage-gated channel", "calcium channel", "cell adhesion", "collagen" or "connective tissue"; those which 
matched the data, suggesting that the proteins are secretory and/or membrane proteins; or those which matched the 
full-length sequences of GenBank or UniGene database with similar description; and, further, those predicted to have 
an N-terminal signal sequence or a transmembrane region as a result of domain search for the amino acid sequences 

30 deduced from the full-length nucleotide sequences. 

[0246] The clones presumably encoding proteins categorized into glycoprotein-associated proteins are those which 
matched the full-length sequences of Swiss-Prot database with the keywords "glycoprotein"; those which matched the 
data, suggesting that the proteins are glycoprotein; or those which matched the full-length sequences of GenBank or 
UniGene database. 

35 [0247] The clones presumably encoding proteins categorized into signal transduction-associated proteins are those 
which matched the full-length sequences of Swiss-Prot database with the keywords "serine/threonine-protein kinase", 
"tyros ine-protein kinase" or "SH3 domain"; those which matched the data, suggesting that the proteins are signal 
transduction-associated proteins (for example, "ADP-ribosylation factor"); or those which matched the the full-length 
sequences of GenBank or UniGene database with similar description. 

40 [0248] The clones presumably encoding proteins categorized into transcription -associated proteins are those which 
matched the full-length sequences of Swiss-Prot database with the keywords "transcription regulation", "zinc finger" 
or "homeobox"; those which matched the data, suggesting that the proteins are transcription-associated proteins; or 
those which matched the full-length sequences of GenBank or UniGene database with similar description. 
[0249] The clones presumably encoding proteins categorized into disease-associated proteins are those which 

45 matched the full-length sequences of Swiss-Prot database with the keywords "disease mutation" or "syndrome"; those 
which matched the data, suggesting that the proteins are disease-associated proteins, or those which matched the 
full-length sequences of Swiss-Prot database and GenBank or UniGene database where the matched sequences are 
those of genes or proteins which had been deposited in the database of Online Mendelian Inheritance in Man (OMIM) 
(http:IAvww.ncbi.nlm.nih.gov/Omim/). which is a database of human genes and diseases. 

so [0250] The following 437 clones were categorized into secretory and/or membrane proteins. 

BNGH41000020, BNGH41000087, BNGH41000091 , HEMBA1000121 , HEMBA1000128, HEMBA1000349, 
HEMBA1000477, HEMBA1000590 : HEMBA1 000713, HEMBA1000732, HEMBA1000745, HEMBA1000835, 
HEMBA1000940, HEMBA1000962, HEM8A1001221 , HEMBA1001228, HEMBA1 001621 , HEMBA1002131 , 
HEMBA1002163, HEMBA1002167, HEMBA1 002178, HEMBA1002195, HEMBA1 002227, HEMBA1002420, 

55 HEMBA1002421, HEMBA1002767, HEMBA1 003047, HEMBA1003101 , HEMBA1 003230, HEMBA1003392, 
HEMBA1003530, HEMBA1003602, HEMBA1003732, HEMBA1003945, HEMBA1004110, HEMBA1004250, 
HEMBA1 004391, HEMBA1 004444, HEMBA1 004454, HEMBA1 004505, HEMBA1 004797, HEMBA1 004982, 
HEMBA1 005070, HEMBA1 005449, HEMBA1 005522, HEMBA1 005545, HEMBA1 005698, HEMBA1 005945, 
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HEMBA1006171, HEMBA1006299, HEMBA1006311 , HEMBA1006335 ; 

HEMBA1006482, HEMBA1006707 : HEMBA1006724, HEMBA1006749, 

HEMBA1007241 , HEMBB1000407 ; HEMBB1000447, HEMBB1000567, 

HEMBB1 001 026, HEMBB1 001 048 : HEMBB1 001 407, HEMBB1 001 53o! 

HEMBB1 001 978, H EMBB1 002041 , HEMBB1 0021 62, H EMBB1 002245, 

MAMMA1000102, MAMMA1000106, MAMMA1000118, MAMMA1000141, 

MAMMA1 000457, MAMMA1 000473, MAM MA1 000496, MAMMA1 000591, 

MAMMA1 000986, MAMMA1 000994, MAMMA1001043, MAMMA1001141 , 

MAMMA1001418, MAMMA1001893, MAMMA1001957, MAMMA1001978, 

MAMMA1 002070, MAMMA1 002091 , MAMMA1 002095, MAMMA1002165, 

MAMMA1 002633, MAMMA1 0031 26, NT2RM1 000462, NT2RM1 000542, 

NT2RM1 000858, NT2RM 1 000899, NT2RM2000241 , NT2RM200041 0, ' 

NT2RM2001 626, NT2RM2001 792, NT2RM2001 939 : NT2RM2001 941 ] 

NT2RM4000417, NT2RM4000444, NT2RM4000587, NT2RM4000593! 



NT2RM4000997, 

NT2RP1 000050, 

NT2RP1 000551, 

NT2RP2000533, 

NT2RP2000903, 

NT2RP2001755, 

NT2RP2002527, 

NT2RP2003042, 

NT2RP2003655, 

NT2RP2004524, 

NT2RP2005163, 



NT2RM4001325, 
NT2RP1000181, 
NT2RP1000613, 
NT2RP2000616, 
NT2RP2001200, 
NT2RP2001915, 
NT2RP2002533, 
NT2RP2003210, 
NT2RP2003664, 
NT2RP2004556, 
NT2RP2005181, 



NT2RM4001735, 
NT2RP1 000261 , 
NT2RP1 000981, 
NT2RP2000649, 
NT2RP2001276, 
NT2RP2001956, 
NT2RP2002721, 
NT2RP2003383, 
NT2RP2003950, 
NT2RP2004606, 
NT2RP2005378, 



NT2RM4001768, 
NT2RP1 000300, 
NT2RP1001004, 
NT2RP2000663, 
NT2RP2001480, 
NT2RP2002063, 
NT2RP2002824, 
NT2RP2003390, 
NT2RP2004179, 
NT2RP2004648, 
NT2RP2005463, 



NT2RP2005666, NT2RP2005883, NT2RP2005994, NT2RP2006004, 



NT2RP2006042 
NT2RP3000304 
NT2RP3000818 ! 
NT2RP3001240, 
NT2RP3001650, 
NT2RP3002342 1 
NT2RP3002958, 
NT2RP3003614, 
NT2RP3004309, 
NT2RP3004647, 
NT2RP4002451, 
OVARC1 000313 
OVARC1 000956 
OVARC1001725 
PLACE1000912, 
PLACE1001231, 



NT2RP2006269, 

NT2RP3000436, 

NT2RP3000907, 

NT2RP3001271, 

NT2RP3001738, 

NT2RP3002411, 

NT2RP3003000, 

NT2RP3004025, 

NT2RP3004345, 

NT2RP4001001, 

NT2RP4002750, 

OVARC1 000410, 

OVARC1001030, 

OVARC1001991, 

PLACE1000914, 

PLACE1001340, 



NT2RP2006512, 
NT2RP3000460, 
NT2RP3000921, 
NT2RP3001322, 
NT2RP3001858, 
NT2RP3002737, 
NT2RP3003076, 
NT2RP3004075, 
NT2RP3004406, 
NT2RP4001009, 
OVARC1 000003, 
OVARC 1000439, 
OVARC1001163, 
PLACE1 000033, 
PLACE1 000927, 
PLACE1 001401 



NT2RP2006580, 
NT2RP3000616, 
NT2RP3001012, 
NT2RP3001388, 
NT2RP3002015, 
NT2RP3002790, 
NT2RP3003354, 
NT2RP3004083, 
NT2RP3004481, 
NT2RP4001467, 
OVARC1000105, 
OVARC1 000553, 
OVARC1001336, 
PLACE 1000231, 
PLACE1001016 : 
PLACE 1001 407, 



PLACE1001536, PLACE1001564, PLACE1001655, PLACE1001 795, 



PLACE1001836 
PLACE1 002374 
PLACE1 002967 
PLACE1 003845 
PLACE 1004482 
PLACE1 005003 
PLACE1 005544 
PLACE1 005927 
PLACE1006443 : 
PLACE1007296 ; 
PLACE1008716, 
PLACE1 009527, 
PU\CE1010251, 
PLACE1011116, 
PLACE3000213, 



PLACE1001918, 
PLACE1002518, 
PLACE 1003407, 
PLACE 1003852, 
PLACE1 004520, 
PLACE 1005005, 
PLACE 1005569, 
PLACE 1006071, 
PLACE1006716, 
PLACE 1007626, 
PLACE1 008744, 
PLACE1 009546, 
PLACE1010445, 
PLACE1011181, 
PLACE4000354, 



PLACE1001949, 
PLACE1 002547, 
PLACE1 003573, 
PLACE1 004279, 
PLACE1 004630, 
PLACE1 005031, 
PLACE1 005660, 
PLACE1 006093, 
PLACE1 006809, 
PLACE1 007845, 
PLACE 1008985, 
PLACE1 009600, 
PLACE1 010713, 
PLACE1011236 ; 
SKNMC1 000004, 



PLACE1 002080, 
PLACE 1002726, 
PLACE 1003737, 
PLACE 1004282, 
PLACE1 004637, 
PLACE 1005383, 
PLACE1 005725, 
PLACE1 006208, 
PLACE 1006959, 
PLACE1 007881, 
PLACE1 009067, 
PLACE1 009982, 
PLACE1010784, 
PLACE1011516, 
SKNMC1 000014, 



HEMBA1 006357, 
HEMBA1 006902, 
HEMBB1 000679, 
HEMBB1001573, 
H EM BB1 002427, 
MAMMA1 000204, 
MAMMA1 000681 . 
MAMMA1001237. 

MAMMA1 002234. 
NT2RM 1000580, 
NT2RM2000423, 
NT2RM4000198, 
NT2RM4000648, 
NT2RM4001843, 
NT2RP1 000325, 
NT2RP1001563, 
NT2RP2000694, 
NT2RP2001495, 
NT2RP2002188, 
NT2RP2002942, 
NT2RP2003469, 
NT2RP2004205, 
NT2RP2004794, 
NT2RP2005541, 

NT2RP3000169, 
NT2RP3000676, 
NT2RP3001159, 
NT2RP3001560, 
NT2RP3002160, 
NT2RP3002836, 
NT2RP3003532, 
NT2RP3004130, 
NT2RP3004552, 
NT2RP4001879, 
OVARC1 000298, 
OVARC1 000811, 
OVARC1 001570, 
PLACE1 000560, 
PLACE1001123, 
PLACE1 001464, 

PLACE1002095, 
PLACE1 002905, 
PLACE1 003772, 
PLACE1 004441, 
PLACE1 004648, 
PLACE1 005410, 
PLACE1005745, 
PLACE 1006277, 
PLACE 1007081, 
PLACE1 008359, 
PLACE1009196, 
PLACE1010011, 
PLACE1010827, 
PLACE1011708, 
SKNMC1 000082, 



HEMBA1 006430, 
HEMBA1 006960, 
HEMBB1 000881, 
HEMBB1001847, 
HEMBB1 002693, 
MAMMA1 000226, 
MAM MA1 0008 10, 
MAMMA1001344, 

MAMMA1 002586, 
NT2RM1 000855, 
NT2RM2000565, 
NT2RM4000284, 
NT2RM4000761 , 
NT2RM4002352, 
NT2RP1 000448, 
NT2RP2000479, 
NT2RP2000818, 
NT2RP2001514, 
NT2RP2002232, 
NT2RP2002976, 
NT2RP2003593, 
NT2RP2004495, 
NT2RP2005027, 
NT2RP2005597, 

NT2RP3000171, 
NT2RP3000721, 
NT2RP3001195, 
NT2RP3001592, 
NT2RP3002311, 
NT2RP3002900, 
NT2RP3003535, 
NT2RP3004133, 
NT2RP3004625, 
NT2RP4002187, 
OVARC1 000307, 
OVARC1 000873, 
OVARC1001607, 
PLACE1 000740, 
PLACE1001183, 
PLACE1001516, 

PLACE1 002355, 
PLACE1 002911, 
PLACE1 003839, 
PLACE1 004450, 
PLACE1004816, 
PLACE1 005426, 
PLACE 1005878, 
PLACE1 006290, 
PLACE1 007096, 
PLACE1 008469, 
PLACE1 009279, 
PLACE101 0078, 
PLACE1010968, 
PLACE3000181, 
THY RQ1 000036, 
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THYR01 000099, THYR01 0001 96, THYR01 000400, THYR01 000584, 
THYRO1000795, THYRO1000956, THYRO1001102, THYRO1001113, 
THYR01 001 205, THYR01 001 237, THYR01 001 242, THYR01 001 266, 

THY R0 1 00 1 523 , TH Y R 0 1 00 1 529 , 
Y79AA1 000226, Y79AA1 000270, 



TH Y R0 1 000678 , THY R0 1 000776 , 



THYROI 001478, 
Y79AA1 000207, 
Y79AA1 000888, 
Y79AA1001426, 
Y79AA1001799, 



Y79AA1 000959, 
Y79AA1001427, 
Y79AA1001803, 



Y79AA1001013, 
Y79AA1 001430, 
Y79AA1 002022, 



THYRO1001641, 
Y79AA1 000426, 
Y79AA1001212, 
Y79AA1001727, 
Y79AA1 002058, 



THYRO1001327, 
THYRO1001702, 
Y79AA 1000521, 
Y79AA1001264, 
Y79AA1001787, 
Y79AA1002129, 



Y79AA1 002373, 

[0251] The following 1 46 clones were categorized into glycoprotein -associated proteins. 



BNGH41 000087, 
HEMBA1001221, 
HEMBA1 002767, 
HEMBA1 004505, 
HEMBA1 006902, 
HEMBB1 002427, 



BNGH41 000091, 
HEMBA1 001228, 
HEM BA1 003230, 
HEMBA1 005449, 
HEMBB1 000679, 
MAMMA1000102. 



MAMMA1 002070, MAM MA 1002586, 
NT2RM2001 81 8, NT2RM2001 939, 



NT2RM4000648, 
NT2RP1001004, 
NT2RP2002533, 
NT2RP2005181, 
NT2RP2006269, 
NT2RP3002737, 
NT2RP3004481, 
OVARC1 000003, 



NT2RM4000997, 
NT2RP2000616, 
NT2RP2003042, 
NT2RP2005541 , 
NT2RP3000304, 
NT2RP3002958, 
NT2RP3004552, 
OVARC1 000553, 



HEMBA1 000349, 
HEMBA1001621, 
HEMBA1 003392, 
HEMBA1 005522, 
HEMBB1 000881, 
MAMMA1 000591, 
MAMMA1003126, 
NT2RM2001 941 , 
NT2RM4001325, 
NT2RP2000694, 
NT2RP2003210, 
NT2RP2005597, 
NT2RP3000616, 
NT2RP3003000, 
NT2RP3004640, 
OVARC1 000811, 



HEMBA1 000590 
HEMBA1002131 
HEMBA1 004250 
HEMBA1 005545 
HEMBB1001048 
MAMMA1 000681 



NT2RM1 000462, NT2RM 1 000580 



NT2RM4000198 

NT2RM4002352 

NT2RP2000903 

NT2RP2004205 

NT2RP2005883 

NT2RP3000921 

NT2RP3003532 

NT2RP4000108 

OVARC1 000873 



OVARC1001991, PLACE1000033, PLACE1000740, PLACE1001016, 



PLACE1001123, 
PLACE1 002374, 
PLACE1 003839, 
PLACE1 005003, 
PLACE1 007845, 
PLACE1010968, 
THYRO1000196 



PLACE1001231, 
PLACE1 002905, 
PLACE 1004282, 
PLACE 1005426, 
PLACE1008716, 
PLACE1011116, 
THYROI 000584, 



PLACE1001464, 
PLACE1002911, 
PLACE1 004441, 
PLACE1 006071, 
PLACE1 008744, 
PLACE3000181, 



PLACE1001655, 
PLACE1 003573, 
PLACE 1004450, 
PLACE1 006073, 
PLACE 1008985, 
PLACE3000213, 
THYRO1001266, 



HEMBA1 000745 
HEMBA1002178 
HEMBA1 004391 
HEMBA1 006707 
HEMBB1002120 
MAMMA1001043 



NT2RM4000284 
NT2RP 10006 13 
NT2RP2001480 
NT2RP2004606 
NT2RP2006004 
NT2RP3001650 
NT2RP3004130 
NT2RP4001467 
OVARC1001336 

PLACE1001836 
PLACE1 003737 
PLACE1 004520 
PLACE1 006290 
PLACE1010251 
PLACE4000354, 
Y79AA1 000270 



THYROI 000956, 
Y79AA1001727, Y79AA1 001 795, Y79AA1 002022, Y79AA1002213, 

[0252] The following 55 clones were categorized into signal transduction-associated proteins. 



HEMBA1 000006, 
HEMBA1 006658, 
NT2RM2001902, 
NT2RP2003179, 
NT2RP3000201, 
NT2RP4000634, 
OVARC1 001569, 
PLACE1 008282, 
THYRO1001593. 



HEMBA1 0021 95, HEMBA1 002227, 
HEMBA1006916, MAMMA1 000881 , 
NT2RP1 001 020, NT2RP1 001 031 , 



NT2RP2003545, 
NT2RP3000820, 
NT2RP4000962, 

PLACE1 002329, 
PLACE1 008297, 

THYRO1001700, 



NT2RP2004670, 
NT2RP3003527, 
OVARC 1000255, 
PLACE 10031 35, 
PLACE1010081, 
THYRO1001770, 



HEMBA1 002551, 
MAMMA1001150, 
NT2RP2001469, 
NT2RP3000011, 
NT2RP3003849, 
OVARC1 000529, 
PLACE 1003598, 
PLACE1011364, 
Y79AA1 000777, 



HEMBA1 005084, 
MAMMA1001310, 
NT2RP2001529, 
NT2RP3000022, 
NT2RP3003874, 
OVARC1000916, 
PLACE1005519, 
PLACE1011824, 
Y79AA1 000967, 



THYR01 001456, 
THYRO1001725, 
Y79AA1 000876, 
Y79AA1001328, 
Y79AA1001795, 
Y79AA1002213, 



HEMBA1 000835, 
HEMBA1 002421, 
HEMBA1 004444, 
HEMBA1 006749, 
HEMBB1 002245, 
MAMMA1001237, 
NT2RM2001792, 
NT2RM4000417, 
NT2RP1 000981, 
NT2RP2001755, 
NT2RP2005027, 
NT2RP2006042, 
NT2RP3002160, 
NT2RP3004133, 
NT2RP4002750, 
OVARC1001607, 



PLACE1 
PLACE1 
PLACE1 
PLACE1 
PLACE1 
THYR01 
Y79AA1 



002355, 
003772, 
004648, 
007081 , 
010784, 
000036, 
000426, 



Y79AA1 002381, 

[0253] The following 80 clones were categorized into transcription -associated proteins. 

HEMBA1 001 297, HEMBA1 001 390, HEMBA1 001 886, 
HEMBA1 004785, 
HEMBB1000106, 
MAMMA1001532, 
NT2RP 1000271, 
NT2RP2001538, 
NT2RP2003138, 
NT2RP2005391 , 
NT2RP3000232, 
NT2RP3004294, 
PLACE 1000442, 



HEMBA1 000462, 

HEMBA1003120, 

HEMBA1 006572, 

HEMBB1 002661, 

NT2RM2000773, 

NT2RP2000712, 

NT2RP2002564, 

NT2RP2004847, 

NT2RP2006092, 

NT2RP3004090, 

OVARC1001807, 



HEMBA1 000671, 
HEMBA1 003497, 
HEM BA1 007226, 
MAMMA1001094, 
NT2RM4000326, 
NT2RP2000739, 
NT2RP2002974, 
NT2RP2005247, 
NT2RP3000148, 
NT2RP3004119, 
PLACE1 000258, 



HEMBA1 005230, 
HEMBB1 000905, 
MAMMA1001615, 
NT2RP1 000468, 
NT2RP2001662, 
NT2RP2003302, 
NT2RP2005535, 
NT2RP3000378, 
OVARC1001049, 
PLACE 1000907, 



H EM BA1 005246, 
HEMBB1001959, 
NT2RM1 000789, 
NT2RP2000092, 
NT2RP2001817, 
NT2RP2003940, 
NT2RP2005774, 
NT2RP3000652, 
OVARC1001086, 
PLACE1 003529, 



HEMBA1 005929, 

MAMMA1002142, 
NT2RP2001769, 
NT2RP3000172, 
NT2RP3004067, 

OVARC1001338, 
PLACE1 006208, 

THYRO1001457, 
Y79AA1 002376, 



HEMBA1 002048, 
HEMBA1 006276, 
HEMBB1 002051, 
NT2RM2000632, 
NT2RP2000610, 
NT2RP2001948, 
NT2RP2004108, 
NT2RP2005941, 
NT2RP3001976, 
OVARC1001132, 
PLACE1004166, 
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PLACE1004168, PLACE1004887, PLACE1005250, PLACE1005682, PLACE1006079, PLACE 1008549, 
PLACE1011407, PLACE1011978, THYR01 000580, Y79AA1 000030, Y79AA1001090, Y79AA1001 523, 
Y79AA1 002334, Y79AA1 002378, 

[0254] The following 85 clones were categorized into disease-associated proteins. 

5 BNGH41000020, HEMBA1000349, HEMBA1000590, HEMBA1000671 , HEMBA1000835 : HEMBA1001184, 
HEMBA1001228, HEMBA1001886, H EM BA 1003120, HEMBA1004250, HEMBA1005246, HEMBA1005267, 
HEMBA1006707, HEMBA1006749, HEMBA1006902, HEMBA1006916, HEMBA1007013, HEMBB1002120, 
MAMMA1 000204, MAMMA1 002080, NT2RM2000632, NT2RM2001126, NT2RM2001558, NT2RP1 000271 , 
NT2RP1 000465, NT2RP1 000579, NT2RP2000447, NT2RP2000514, NT2RP2000739, NT2RP2001223, 

10 NT2RP2001529, NT2RP2001562, NT2RP2002674, NT2RP2003369, NT2RP2004108, NT2RP2004205, 
NT2RP2005535, NT2RP2005941 , NT2RP2006004, NT2RP3000059, NT2RP3000125, NT2RP3000201 , 
NT2RP3000232, NT2RP300061 6, NT2RP3000677, NT2RP3000838, NT2RP3000921 , NT2RP3001542, 
NT2RP3002286, NT2RP3002721 , NT2RP3002737, NT2RP3002738, NT2RP3004481, OVARC1000208, 
OVARC1000275, OVARC1000331 , OVARC1000410, OVARC1001086, OVARC1001132, OVARC1001 607, 

15 OVARC1001725, OVARC1001952, PLACE1000258, PLACE1 000442, PLACE1000907, PLACE1001100, 
PLACE1001500, PLACE1002905, PLACE1 002967, PLACE 1003407, PLACE1003428, PLACE1005005, 
PLACE1005239, PLACE1005815, PLACE1007028, PLACE1008716, PLACE1011407, PLACE1011978, 
PLACE2000118, THYRO1000580, THYRO1000866, THYRO1001071 , THYRO1001478, Y79AA1001062, 
Y79AA1001530, 

20 [0255] Out of them, the following 67 clones are those which matched the data of Swiss-Prot database and GenBank 
or UniGene database, genes or proteins which had been deposited in the database of Online Mendelian Inheritance 
in Man (OMIM) (http://www.ncbi.nim.nih.gov/Omim/), which is a database of human genes and diseases. (The corre- 
sponding OMIM numbers are indicated after the clone names.) 

HEMBA1 000349(*600046), HEMBA1 000590(*603897), HEMBA1 000671 (*602277) ; HEMBA1 001 886(*603899), 

25 HEMBA1003120(*602277), HEMBA1004250(*600976), HEMBA1005246(*602291) : HEMBA1005267(*1 06410), 
HEMBA1006707(*603897), HEMBA1006749(*603897), HEMBA1006902(*603897) : HEMBA1 00691 6(*601 524), 
HEMBA1007013(*603730), HEMBB1002120(*603367), MAMMA1002080(*602672), NT2RM2001126(*603785), 
NT2RM2001558(*604689), NT2R P1 000271 (*603899), NT2RP1000465f602231), NT2RP2000447(* 602580), 
NT2RP2000514(*602431), NT2RP2000739(* 194558), NT2RP2001223(*603558), NT2RP2001529(*603289), 

30 NT2RP2001562(*603371), NT2RP2002674(* 132811), NT2RP2003369(*1 79555), NT2RP20041 08(*601260), 
NT2RP2004205(*601 61 0), NT2RP2005535(*603899), NT2RP2006004(*600245), NT2RP3000059(*1 0641 0), 
NT2RP3000125(*1 80202), NT2RP3000201 (*604666), NT2RP3000232(*602277), NT2RP300061 6(*600245), 
NT2RP3000677(*1 42765), NT2RP3000838(M 90370), NT2RP3001 542(*1 91 1 61 ) s NT2RP3002286(*604331 ), 
NT2RP3002721 (*118950) 5 NT2RP3002738(*602265), NT2RP3004481 (*601610), OVARC1000208(*603603), 

35 OVARC1 000275(*1 25647), OVARC1 000331 (*1 39265), OVARC1 00041 0(*603874), OVARC 1 00 1 086(*603862) , 
OVARC1 001 725(*603046), OVARC1 001 952(*1 90370), PLACE1 000258(*603971 ), PLACE1 000442(*601 260), 
PLACE1 000907(*1 94558), PLACE1 001 500(*603781 ), PLACE 1 002905(*1 25950), PLACE1 003428(*603570), 
PLACE1005005(*603124), PLACE1005239(*603365), PLACE1 007028(*6021 31 ), PLACE 1 01 1407(*602277), 
PLACE1 01 1 978(*603971 ), PLACE20001 1 8(*301 000), THYR01 000580(*602277), THYROI 000866(*604045), 

40 THYRO1001071(*603533) } Y79AA1 001 062(*1 91161), Y79AA1001530(*602662), 

[0256] Out of 21 5 clones excluding the above-mentioned clones, HEMBB1 000668 and NT2RM4001 377 presumably 
belong to a group of signal transduction-associated proteins, based on the results of domain search by Pfam. 
[0257] HEMBB1 002302 presumably belong to a group of transcription-associated proteins, based on the results of 
domain search by Pfam. 

45 [0258] In the 437 clones categorized into secretory and/or transmembrane proteins on the basis of their full-length 
sequences, 41 0 clones were also predicted to encode proteins having functions of secretory and/or membrane proteins 
on the basis of their partial nucleotide sequences (5' sequences). In the 146 clones categorized into glycoprotein- 
associated proteins on the basis of their fulMength sequences, 124 clones were also predicted to encode proteins 
having functions of glycoprotein-associated proteins on the basis of their partial nucleotide sequences. In the 57 clones 

50 categorized into signal transduction-associated proteins on the basis of their fulMength sequences, 46 clones were 
also predicted to encode proteins having functions of signal transduction-associated proteins on the basis of their 
partial nucleotide sequences. In the 81 clones categorized into transcription-associated proteins on the basis of their 
full-length sequences, 57 clones were also predicted to encode proteins having functions of transcription-associated 
proteins on the basis of their partial nucleotide sequences. In the 85 clones categorized into disease-associated proteins 

55 on the basis of their full-length sequences, 6 clones were also predicted to encode proteins having functions of disease- 
associated proteins on the basis of their partial nucleotide sequences. The number of clones which were predicted to 
encode disease-associated proteins based on the full-length nucleotide sequences is much greater than that predicted 
based on the partial sequences. The reason is that the full-length sequences were categorized by using the data found 
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in the OMIM database into the category of disease-associated proteins. 

[0259] When the predicted functions based on the partial sequences were different from those based on the full- 
length sequences, several reasons were presumed; the ORF is too short in the partial sequence as compared with 
that of the full-length sequence; alternatively, P value for the partial sequence was greater than that for the full-length, 
5 that is, as compared with the probability of occurrence of the predicted function found in the full-length sequence, the 
probability was lower in the partial sequence. A protein does not always belong solely to a single category of the above- 
described functional categories, and therefore, additional functions can be found for the cDNA clones by further anal- 
yses. 

[0260] It is unclear, by the analyses for the full-length sequences so far, whether or not the remaining 212 clones 
io encode proteins belonging to any of the categories of secretory and/or membrane proteins, glycoprotein-associated 
proteins, signal transduction-associated proteins, transcription -associated proteins or disease-associated proteins. 
Nonetheless, the functions which were predicted based on the partial sequences can be verified by further analyses. 
[0261] Among the 212 clones, there are 38 clones that presumably belong to the category of enzymes and/or me- 
tabolism-associated proteins, cell division- and/or cell proliferation-associated proteins, cytoskeleton-associated pro- 
's teins, nuclear proteins, DNA-and/or RNA-binding proteins. ASP- and/or GTP-binding proteins, protein synthesis- and/ 
or protein transport-associated proteins, or cellular defense-associated proteins. The clones containing results of ho- 
mology search of Swiss-Prot database were categorized by considering the keywords and mentioned items in the 
matching data. The clones containing results of homology search of GenBank or UniGene database were categorized 
by considering the definitions and mentioned items in the matching data. 
20 When the matching data contained keywords such as "metabolism", "oxidoreductase" and "E.C. No. (Enzyme com- 
mission number)", the clones were herein defined as clones presumably belonging to the category of enzymes and/ 
or metabolism-associated proteins. When the matching data contained keywords such as "cell division", "cell cycle", 
"mitosis", "chromosomal protein", "cell growth" and "apoptosis", the clones were herein defined as clones presumably 
belonging to the category of cell division- or cell proliferation-associated proteins. When the matching data contained 
25 keywords such as "structural protein", "cytoskeleton", "actin-binding" and "microtubules", the clones were herein de- 
fined as clones presumably belonging to the category of cytoskeleton-associated proteins. When the matching data 
contained keywords such as "nuclear protein", the clones were herein defined as clones presumably belonging to the 
category of nuclear proteins. When the matching data contained keywords such as "DNA-binding" and "RNA-binding", 
the clones were herein defined as clones presumably belonging to the category of DNA- or RNA-binding proteins. 
30 When the matching data contained keywords such as "ATP-binding" and "GTP-binding", the clones were herein defined 
as clones presumably belonging to the category of ATP- and/or GTP-binding proteins. When the matching data con- 
tained keywords such as "translation regulation", "protein biosynthesis", "amino-acid biosynthesis", "ribosomal protein", 
"protein transport" and "signal recognition particle", the clones were herein defined as clones presumably belonging 
to the category of protein synthesis- and/or protein transport-associated proteins. When the matching data contained 
35 keywords such as "heat shock", "DNA repair" and "DNA damage", the clones were herein defined as clones presumably 
belonging to the category of cellular defense-associated proteins. 

[0262] The following 1 0 clones presumably belong to enzymes and/or metabolism-associated proteins. 
HEMBA1003315, HEMBB 1002465, MAMMA1000614, NT2RP2000178, NT2RP2001388, NT2RP2001903, 
NT2RP2002304, NT2RP2005878, NT2RP3001685, PLACE1006219 
40 [0263] The following 4 clones presumably belong to cell division-associated and/or cell proliferation-associated pro- 
teins. 

MAMMA1 000403, NT2RM2000497, NT2RP2000394, Y79AA1002121 
[0264] The following 6 clones presumably belong to cytoskeleton-associated proteins. 
MAMMA1001609, NT2RM 2000589, NT2RP3000063, PLACE1 004078, PLACE 1004492, PLACE 1008657 
45 [0265] The following 7 clones presumably belong to nuclear proteins. 

HEMBA1001878, HE MBA 1002992, MAMMA1000614, NT2RM4000965, NT2RM2001 738, NT2RP2001388, 
Y79AA1002121 

[0266] The following 5 clones presumably belong to DNA- and/or RNA-binding proteins. 
HEMBA1003072, HEMBA1006770, HEMBA1 007332, NT2RM2000497, Y79AA1002121 
50 [0267] The following 7 clones presumably belong to ATP- and/or GTP-binding proteins. 

HEMBA1002316, MAMMA1001609, NT2RM2000306, NT2RM2000497, NT2RP2000178, NT2RP3003729, 
PLACE1 004305 

[0268] The following 7 clones presumably belong to protein synthesis- and/or protein transport-associated proteins. 
NT2RM4000965, NT2RP2005069, NT2RP3000481 , NT2RP3000789, NT2RP4001877, OVARC1001833, 
55 OVARC 1002058, 

[0269] The following clone presumably belongs to cellular defense-associated proteins. 
PLACE 1005539 

[0270] Although it is unclear whether or not 26 out of 174 clones other than the above-mentioned clones belong to 
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any of the above-described categories, these clones are predicted to have some functions, based on the homology 
search using their full-length sequences. The clone names and the gene definitions found in the result of homology 
search are shown below, separated by a double-slash// 

HEMBA1000634//Homo sapiens T-cell activation protein (PGR1) gene, complete cds. 
HEMBA1002524//Human MHC Class I region proline rich protein mRNA, complete cds. 
HEMBA1003399//MVP1 PROTEIN, 

HEMBA1 005489//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, com- 
plete cds. 

HEMBB1000542//Mus musculus bromodamain-containing protein BP75 mRNA, complete cds. 
MAMMA1000788//Bos taurus P14 (p14) mRNA, complete cds. 
MAMMA1002128//ABC1 PROTEIN HOMOLOG PRECURSOR. 
NT2RM2000514//Homo sapiens F-box protein Fbx21 (FBX21) mRNA. complete cds. 
NT2RM2000622//Mus musculus F-box protein FBL10 mRNA, partial cds. 
NT2RM4000100//Homo Sapiens Leman coiled-coil protein (LCCP) mRNA, complete cds. 
NT2RP2005425//Homo sapiens mRNA for AKAP450 protein. 

NT2RP3001170//Mus musculus activity-dependent neuroprotective protein (Adnp) mRNA, complete cds. 
NT2RP3002571//Bos taurus mRNA for lyncein. 

NT2RP3004557//Human Ki nuclear autoantigen mRNA, complete cds. 
OVARC1001596//Homo sapiens Arf-like 2 binding protein BART1 mRNA, complete cds. 
PLACE1002153//Homo sapiens TACC2 protein (TACC2) mRNA, partial cds. 
PLACE1003163//Homo sapiens DBI-related protein mRNA, complete cds. 
PLACE1005736//Human mRNA for BAS-GRIP protein. 
PLACE 1007702//Mus musculus TRA1 mRNA, complete cds. 
PLACE1011045//Homo sapiens EMike protein mRNA, complete cds. 
THYRO1000061//Mus musculus mRNA for UBE-1c1, UBE-1c2, UBE-1c3, complete cds. 
THYRO1000964//Drosophila melanogaster Felle associated protein Pellino (Pli) mRNA, complete cds. 
Y79AA1 000776//Mus musculus mRNA for GSG1 , complete cds. 
Y79AA1001056//Homo sapiens MAID protein mRNA, complete cds. 

Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds. 
Y79AA1 001 793//Mus musculus mRNA for GSG1 , complete cds. 

[0271 J So far, useful information for presuming the functions are unavailable for the remaining 148 clones, of which 
names are listed below. 



HEMBA1 000275, 
HEMBA1 001296, 
HEMBA1 003487, 
HEMBA1005145, 
HEMBA1006912, 
HEMBB1 000642, 



HEMBA1 000300, 
HEMBA1001563, 
HEMBA1 004007, 
HEMBA1 005430, 
HEMBA1 007057, 
HEMBB1001200. 



HEMBA1 000443, HEMBA1 000875, 

HEMBA1002164, HEMEA 1002239, 

HEMBA1 004067, HEMBA1004085, 

HEMBA1 00591 3, HEMBA1 00601 6, 

HEMBA1 007063, HEMBA1 007291 , 

HEMBB1 001 547, HEMBB 1002039, 



MAMMA1 000046, MAMMA1 000449, MAM MA 1000528, MAMMA1 000652, 
MAMMA1001066, MAMMA1001284, MAMMA1001623, MAMMA1001634, 
MAMMA1 002205, MAMMA1 002224, NT2RM2000582, NT2RM2001 643, 



NT2RM4001321, 
NT2RP2000240, 
NT2RP2003599, 
NT2RP2005514, 
NT2RP3000444, 
NT2RP3002281 , 
NT2RP3002887, 



NT2RP 1000002, 
NT2RP2001878, 
NT2RP2003931, 
NT2RP2005632, 
NT2RP3000645, 
NT2RP3002324, 
NT2RP3002983, 



NT2RP1 000239, 
NT2RP2001921, 
NT2RP2004069, 
NT2RP2005887, 
NT2RP3000871, 
NT2RP3002353, 
NT2RP3003448, 



NT2RP1 000679, 
NT2RP2002015, 
NT2RP2004141, 
NT2RP2006099, 
NT2RP3001044, 
NT2RP3002409, 
NT2RP3003469, 



NT2RP3003963, NT2RP3004000, NT2RP3004202, NT2RP3004321, 



NT2RP3004355, 
OVARC1 000775, 
OVARC1002178, 
PLACE1 003460, 
PLACE1 005669, 
PLACE 1007591, 



NT2RP3004374, 
OVARC1 000853, 
PLACE1 000986, 
PLACE 1003644, 
PLACE1 005768, 
PLACE 1007971, 



NT2RP4002715, 
OVARC 1000995, 
PLACE1001114, 
PLACE 1004028, 
PLACE1006515, 
PLACE1 008984, 



OVARC1 000090, 
OVARC1001222, 
PLACE1001229, 
PLACE1004199, 
PLACE1 006786, 
PLACE1 009735, 



THYR01 000846, THYR01 000999, THYROI 001 063, THYR01 001 1 28, 



HEMBA1 000907, 
HEMBA1 002985, 
HEMBA1 004952, 
HEMBA1006517, 
HEMBB 1000276, 
HEMBB1 002228, 
MAMMA1 000706, 
MAMMA1001901, 
NT2RM4000115, 
NT2RP1 000740, 
NT2RP2002409, 
NT2RP2004447, 
NT2RP2006134, 
NT2RP3001061, 
NT2RP3002448, 
NT2RP3003473, 

OVARC1000137, 
OVARC1001260, 
PLACE1001788, 
PLACE1004519, 
PLACE1 007040, 
PLACE2000219, 
THYRO1001471, 



HEMBA1001272, 
HEMBA1 003294, 
HEMBA1 004971, 
HEMBA1 006544, 
HEMBB1 000309, 
HEMBB1 002663, 
MAMMA1000814, 
MAMMA1 002087, 
NT2RM4000295, 
NT2RP1 000903, 
NT2RP2002510, 
NT2RP2004837, 
NT2RP3000427, 
NT2RP3001754, 
NT2RP3002664, 
NT2RP3003559, 

OVARC1 000467, 
OVARC1001727, 
PLACE1 003438, 
PLACE1 005601, 
PLACE1 007077, 
PLACE4000455, 
THYRO1001495, 
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THYRO1001608, THYRO1001803, Y79AA1000127, Y79AA1 000750, Y79AA1001592, Y79AA1001863, 

EXAMPLE 18 

5 Expression frequency analysis using PCR 

[0272] Many genes acting at the downstream of TNF-. and IL-1. among inflammation-associated cytokines have 
been previously identified. The respective stimulations are transduced through independent pathways of signaling 
cascade. There exists another signaling cascade for both stimulations, wherein NF-.B is a common transducing mol- 

10 ecule shared by the two stimulations (J. Leukoc. Biol., 1994, 56(5): 542-547). It has also been revealed that many 
inflammation-associated genes, including IL-2, IL-6 and G-CSF, are varied in their expression levels in response to 
the signal through the common pathway (Trends Genet. 1999, 15(6): 229-235). A survey was performed by using 
ATAC-PCR method (adaptor-competitive PCR method: Nucleic Acids Res. 1997, Nov 15; 25(22): 4694-6) for genes 
of which expression levels were varied depending on stimulation of inflammatory cytokines, TNF-. and IL-1 It is pos- 

15 sible that genes of which expression is varied in response to this stimulation also participate in inflammation. 

[0273] Jurkat cells (Dainippon Pharmaceutical Co., Ltd.: catalog No. 06-152) were cultured in a PRMI1640 medium 
(Nikken Biological and Medical Institute: catalog No. 14-501 F) containing 10% fetal calf serum until the cell count 
reached to 7 cells. The cells were transferred into a fresh medium containing 10 ng/ml TNF-. (recombinant Tumor 
Necrosis Factor; Wako pure chemical Industries Inc.: catalog No. 201-13461) or IL-1. (recombinant lnterleukin-1.; 

20 PeprotechEC: catalog No. 200-01 B) and, further, cultured at 37. under an atmosphere of 5% C0 2 . The cells cultured 
in the presence of TNF-. were harvested 1 , 3 and 7 hours after addition of TNF-.. The cells cultured in the presence 
of IL-1 . were harvested 1 and 7 hours after addition of IL-1 .. Total RNA was extracted from each of the cells by AG PC 
method (Acid-Guanidinium-Phenol-Chloroform method: Ana Biochem. 1987, Apr; 1 62(1 ):1 56-9). Total RNA was also 
extracted form the cells in the absence of any stimulation of TNF-. and IL-1 

25 [0274] ATAC-PCR analysis is performed basically according to the same procedure as described in "DNA Microarray 
and Advanced PCR Methods" (Cell Engineering, p. 104-112, (additional volume, Genome Science Series 1), Muram- 
atsu & Naba (eds.), Shujunnsya). Adaptor ligation reaction was performed for an internal standard sample (which was 
used for preparing a calibration curve for the assessment of the test samples) and test samples in the following inde- 
pendent two reaction systems. Combinations of each type of the 6 adaptors (AD-1 , AD-2, AD-3, AD-4, AD-5, and AD- 

30 6: see the sequences shown below) with each sample are as follows: 

Reaction system A 

AD1 : internal standard sample (x10 concentration) 
35 AD2: sample before stimulation 

AD3: internal standard sample (x3 concentration) 
AD4: sample with IL-1 stimulation for 1 hour 
ADS: sample with IL-1 stimulation for 7 hours 
AD6: internal standard sample (x1 concentration) 

40 

Reaction system B 

AD1 : internal standard sample (x1 concentration) 
AD2: sample with TNF stimulation for 1 hour 
45 AD3: sample with TNF stimulation for 3 hours 

AD4: internal standard sample (x3 concentration) 
AD5: sample with TNF stimulation for 7 hours 
AD6: internal standard sample (x10 concentration) 

50 

Adaptor sequence 
A01; 

SEO ID N0:41 80//5' - GTACATATTGTC GTTAGAACGCG - 3 ' 

55 
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SEO 10 NG;4 1 8 1 //3* -CATGTATAACAGCAATCTTGCGCCTAG-5' 
AD2; 

SEQ ID N0:4 1 82/ / 5 f -GTACATATTGTCGTTAGAACGCGACT -3' 

SEQ 10 ^IO:4183//3 , -CATGTATAACAGCAATCTT{;CGCTGACTAG-5 , 
A03; 

SEO ID N0:4184//5 > -GTACATATTGTCGTTAGAACGCGCATACT-3' 

SEO ID N0:4 185//3-CATGTATAACAGCAATCTTGCGCGTATCACTAG-5' 

AD4; 

SEO ID N0:4 1 86//5'-GTACATATTGTCGTTAGAACGCGATCCATACT-3' 
SEO ID N0:4 1 87/ / 3* ~CATGTATAACAGCAATCTTGCGCTAGGTATGACTAG-5' 

AD5; 

SEO ID N0:4188//5 , -GTACATATTGTCGTTAGAACGCGTCAATCCATACT~3' 

SEQ ID N0:4189//3 , -CATGTATAACAGCAATCTTGCGCAGTTAGGTATGACTAG-5' 
AD5; 

SEO ID N0:4 1 90//5'-GTACATATTGTCGnAGAACGCGTACTCAATCCATACT-3' 
SEO ID N0:4 1 9 1 // 3 T ~CATGTATAA.CAGCAATCTTGCGCATGAGTTAGGTATGACTAG-5' 

[0275] In this assay, the internal standard samples used were total RNA from cultured cells or human tissues from 
which the cDNA libraries originated. The cultured cells and the total RNAs from tissues are indicated below. Culture 
of the cells was performed according to the method as described in the supplier's instruction manual RNA preparation 
was carried out by standard methods. 

Human teratocarcinoma cell NT-2 (Stratagene, catalog No. 204101) 

Human neuroblastoma cell SK-N-MC (Dainippon Pharmaceutical Co., Ltd., catalog No. 04-010) 
Human neuroblastoma cell Y79 (Dainippon Pharmaceutical Co., Ltd. ; catalog No. 04-018) 
Human placenta tissues total RNA (BioChin, catalog No. 064008) 
Human breast tissue total RNA (Clontech, catalog No. 64037-1) 

[0276] PCR primers used for amplification of specific genes, and names of the corresponding cDNA clones are 
shown below. The assay was not carried out for clones of which corresponding internal standard sample could not be 
prepared for the assay. The gene-specific primers were designed so that the PCR products derived from the cDNAs 
with adaptor were 70-200 bp in size. Sequence of the adaptor-specific primer (labeled with fluorescent dye (FAM)) 
used for the competitive PCR was GTACATATTGTCGTTAGAACGC (22 nucleotides, SEQ ID NO: 4192) PCR was 
performed basically at 94. for 5 minutes; and at 94. for 30 seconds, at 50. for 60 seconds, and at 72. for 60 seconds 
for 30 cycles. The annealing temperature was, however, changed in some PCR experiments. 
[0277] Nucleotide sequence of clone-specific primer (all the primers consist of 20 nucleotides) used in this experiment 
[0278] Clone names, primer sequences, and SEQ ID NOs were shown in this order, separated with a double-slash 
mark, // 
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MAM MA 1 00004 6//GTTACATCCAAGCATAC AGG//SEQ ID NO:4193 
MAMMA 1000 1 02//ACGGGGTCTCA7TCTGACAC//SEQ ID N0:4194 
MAMMA1 0001 4 1 // CAAGGTAACACGAGTCTATC//SEO ID N0:4195 
MAMMA1000226//ACTGAGGGGCAAAGGAGAGA//SEQ ID NO:4196 
MAMMA 1 000403/ /ATTTTCCTGGAGAGCCGACT //SEO ID N0.4J97 
MAMMAI 000473/ /TGGAAAGTGTACCGGAATTG//SEO ID N0:4198 
MAMMAI 000496/ /TCAATCTGGCGCTCTGTCAC//SEQ ID N0:4199 
MAM MA 1 0006 1 4/ / AGTTCTTTACA T GCTGAGGT/ /SEQ ID N0.4200 
MAMMAI 000652//TGGTGGAAGACTGGGTTTGC//SEO ID N0:4201 
MAMMA1000706//ATGGTCTTGTGGTGCCAGGT//SEQ ID NO:4202 
MAMMAI 000788/ /TGTCCAAAAGCCACACAGAG/ /SEQ ID N0:4203 
MAMMA 1 0008 10// ATACTTCCCGCACCCCAAAA//SEQ ID N0:4204 
MAMMAI000814// CAGGGTTTCTGCATGTTGGC //SEQ ID N0:4205 
MAMMAI 00088 1//ATGGAGTTTCACTCTTGTTG//SEQ ID N0:4206 
MAMMAI 000986/ /TGCTGCTTCTTTACATGGGA//SEQ ID N0:4207 
MAMMAI 000994/ / CAGGATAGAGGTTGCAGGCT//SEQ ID N0:4208 
MAM MA1 00 1 066//GATGGGGTCTCACTCTGTCA//StQ ID N0:4209 
MAMMAI 00 1094//ACGTCCAGAAACTACAGGGT//SEQ ID NO:4210 
MAMMAI 001 1 41//ACTGTACTTAGGATGCTTCA//SE0 10 N0:4211 
MAMMAI 001 237//GGGCAACCCTATGTAGATGA/ /SEQ ID N0:4212 
MAMMAI 00 1 284//GTCTGCTCTGTTACATAGGG//SEQ ID N0:4213 
MAMMAI 001 3 10//ACGCCTGTAATCCCAACCCA//SEQ ID N0:4214 
MAMMA1 00 1 344//GCCAGTTTGTTCTAGGATGC/ /SEQ ID NO:4215 
MAMMA1 001 532//ACATCTATAAGGCTGTTTGC//SEQ ID N0:4216 
MAMMA1001609//GGGTC1CACTCTGTTACCCA//SEQ ID NO:4217 
MAMMA i 00 1 6 1 5// CAAGGGACACTGAGAACTGG/ /SEQ ID N0:4218 
MAMMA 1 00 1 623/ / GGATTGATGCCCGATACTTA//SEQ ID N0.4219 
MAMMA1001 901 //GATAGGGTCTCATTCTGTTA//SEQ ID N0:4220 
MAMMA 1 00 1 95 7/ /TAGTAGAGACGGGGT ITCAC //SEQ ID N0:4221 
MAMMAI 00 1978//CTCCCTCAGACGCTTTATTG//SEQ ID N0:4222 
MAM MA 1 002 070// GAAGAGAAACTGGGGCATCC//SEQ ID N0:4223 
MAMMAI 002080//ACTCTCCCTCACTACCACTG//SEQ ID N0:4224 
MAM MA 1 002 08 7//AGCTCGTCTCATGG GCAACT/ /SEQ ID N0:4225 
MAMMAI 00209 1//CTGACGTAGGTGAGGTCCAT/ /SEQ ID N0:4226 
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MAMMAI002C95//CATGTTTAIGGTCAGGTAGT//SEQ ID MO:4227 
^.MMA1002I28/AGGAGCAGATGAGG.AAGGAG//SEO ID K0:4225 
MAMMAI002142//TCTC7TAAACAICCAACACG//SEQ ID NO:4229 
MAMMAI002165//CTCAAACACGCACGCTCAAC//SEQ ID N0.4230 
MAM MAI C02234//TnGTCTTCCTCTAAACCAG//i;EO ID N0:423l 
MAMMA 1 002536/ /CT CCAC CGAAAAG AC CC ATT//S 1 0 ID N0:4232 
MAMMA1002633//CACAGCACCATCTCCAAGCA//SEQ ID N0:4233 
MAMMA 1 003 1 26//ACAGTATCAGAGGAGCAGGA//SE0 ID NC 4234 
NT2RMI0G0452//AGAGCCCAGGACATTTGAGG//SEQ ID N0:4235 
NT2RM1000542//C.AA.GGGCACGTnCAGCACT//SEO D N0:4236 
MT2RMI00073S//nCGCTTGCTCnCTCTGGA//!;EO ID N0:4237 
NT2RM 1 000655/ / AGCAAGCTCTCCAGAGTGTG/ /SEO ID N0:4233 

.NT2RMI000858//GAACAGGAAAATGCTCTCAG//SEO ID N0:4239 
NT2RM2O0O241//CTCCTTGCCTGCCTTGTAGT//SEQ ID N0:4240 
NT2RM2000305//TAGTCCCTT7CCTGATCGTC//SEQ 0 NO:4241 
NT2RM2000497//TTACTACAGACGGTGTTTCA//SEC ID N0:4242 
NI2RM20005 1 4/ /TGTTCTCTTTCTTTGCACTG/ /SEQ ID N0:4243 

NT2RM2000582//GGGAATAACATCTACAACCT//5EQ ID N0;4244 
NT2RM2000588//CCCCAGAAACACAGAAGGCT//SEG ID N0.4245 
NT2RM2C0C589//TAAGGCATCTGTCCTCTAAC//5EQ ID N0;4246 
NT2RM200C 622/ / ATCICGGCTATGAACTGCTC//SEQ ID N0:4247 
N72RM2000773//TTCTGCCCCTCCTCAMCCT//SEQ ID N0:4248 
NT2RM2001 126//GCAACAGCTTCTTCATGGGT//5EQ !D N0.4249 
NT2RM2G0 1 558/ /TCGCCCACACTGCATCCTTT//SEO ID N0:4250 
MT2RU2001 626//TCGGGTGGCAGTGTGTTGAA//SEQ ID N0:4251 
NT2RM2001 643/ / GTAGTTCTCTCTGAAGAATC/ /SEQ 10 N0:4252 
NT2RM200 1 738/ /CCCAGCACTTTTATTTGTAG//SEQ ID NO:4253 
NT2RM200 1 792/ / AGTGTAGTTTGGAGATGAGA//SEQ ID N0:4254 
NT2RM20Q 1 902/ /TCCCCATCCAGCCACAGAAA//SEQ ID N0:4255 
NT2RM2002 109//ATTGCCTATAGAAAGTCAGC/ /SEC -0 N0:4256 
NT2RM4000 1 00/ /GCATTTATAGGGCTCAAGAT//5EQ ID N0:4257 
NT2RM40001I5//TAGTTTCGACTTCTGGTTCA//SEQ ID N0:4258 
NT2RM4000284//ACATCCnCAAATCAGCATC//S:Q ID N0:4259 
NT2RM4000295//GCTCTCACCCTGCTCACAAA//SEQ ID N0;4260 
N'T2RM40004 1 7/ /TGCTGTGTAGTATTCCITAG//5EQ ID NO:4261 
NT2RM400076 1 / /CTAATACAATGCCAGTCAGG//SEQ 10 W:4262 
K'T2RM4001377//CTAGCITCCTCTCCCAAC"G//SEQ ID NC:4263 
NT2RM4 00 1 735/ /TCATCACGACTGCTGTAGAG//SEC 10 N0:4264 
NT2RM400I768//TIACTAGAGGTGGGGTTTCA//SEQ ID NO:4265 
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NT2RM400 1 843//CAAGGGCACGTTTCAGCACT//SE0 ID N0:4266 
NT2RP 1 000239//TTAGCAGTGTATCCTTTCCG//SEO 10 N0:4267 
MT2RP 1 000465//TTGCCCAGGCTAGTCTCGAA/ /SEQ ID N0:4268 
NT2RP 1 0004 68//TTGCCCAGGCTAGTCTCGAA// SEC ID NO: 4269 
NT2RP1000679//GGCCTCAGTTCCTTGCATn//SEO IC N0:4270 
NT2RP 1 000740//CGCTATATTTCATGGGCTTT//SEO 10 NO:4271 
NT2RP 1 00 1 03 1 //CATGnACCTATACACACCA//5EQ !D N0:4272 
NT2RP2000092// CAC ACAC ACATG AACTC ATT/ /SE Q ID N0;4273 
NT2RP20001 78//ATCTTTTGTGACAGCTCCAG/ / 5E0 IC N0:4274 
NT2RP2000240//CCATTCCACTCATACCAAGA//SEG ID N0:4275 
NT2RP2000447//aCTTGCCATGATAGGTCTT//5:0 IC N0:4276 
NT2 RP20005 1 0/ / GACATGAGACAAAATTAGCC //SEO ID NO:4277 
NT2RP20006 1 6//AAAATAACTGCCTGGGAGGT//5E0 ID NO:4278 
NT2RP20007;2//GAAGGTAGAGCTTGAGAGAG//SEQ D NO:4279 
KT2RP2000739//AGCAACAGGGAATGGAGGTG//SEO ID NG:42S0 
NT2R 3 200G81 8/ /GGGAAGAAATGAGACAAAGA/ /SEO ID N0:4281 
NT2RP2001 200//ACCACATAGCTGCAGGAAAG//SEO ID N0:4232 
NT2RP200 1 276/ /TTGCCGTTGGTGCTGGTAGT//5EQ ID N0:4283 
NT2RP2001 388//TGGCACAATCTCCGCTCACT/ /SEO ID NO:4284 
NT2RP200 1 469//TAATGGGTGGTGGGAGCTGA//SE0 10 N0:4285 
NT2RP2001 538//GAAAAGGGTAGGCAGCAAGG//SE0 ID N0:4236 
NT2RP2GQ1 562//CACCCTGCCACAAGACATTT/ /SEQ ID N0:4287 
NT2RP200 1 662/ /AAGAGGCTGGGCAAATGGCA//SEQ ID NO:4288 
NT2RP2001 755//CAAGCAAATAATCCAGCCAT//SE0 ID N0:4289 
NT2RP200 1 8 1 7//CTAAAGCAACAGAGGAATAC//SE0 ID N0:4290 
NT2RP200 1921 //TGTGGGGTTGGTCTTTGGAA/ /SEO ID N0:4291 
NT2RP2001948//GCATTGAGGACTnTCCAGA//SEQ ID MO:4292 
NT2RP20020 1 5//GTAGTTCTCTGTGAAGAATC//SI0 D N0:4293 
NT2RP2003 1 38//ATGGAACAGGC7CAGGCAAA//SEQ ID NO:4294 
NT2RP2003 1 94//CACCTCTCATGTTTCTCC AC//SEO ID N0:4295 
NT2RP2003302//AGAGGACAGTTGGAAGATTT//SEO ID N'0:4286 
NT2RP2003390//CTGTGGGTGATTrCTGGCA//5EO ID N0:4297 
NT2RP20C3593//CTGACCCCTAAGTAAATGAA//SEQ 10 N0:4298 
NT2RP2003564//GGGTGTGATGTT"ACTTCTC//SEO ID NO:4299 
NT2RP2003950//CTGAAGGAGG T ACfGACACT//SEQ ID N0:4300 
NT2RP2004069//CATCACCAAGGIAITAAGCC//SEQ D N0:4301 
NT2RP2004 1 03/ / GATAGGTTACTGGTG AATTG/ /SEO ID NO:4302 
N T2 RP2 0050 69//G A.AAATGAACCT G TAGCC TT //SEQ ID N0:4303 
NT2RP2G05378//ATTCTGGCTCCCTCTTCCTC//SE0 ID NO:4304 
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NT2RP200559 1//CAAGCTGGAOTACATGGG//SE0 ID NO.430: 
W2RP2005535//tfJinATCGCWTCTCTCn//SE0 10 N0:4306 
NT2RF2005597/ / GACCAAGACCCTTCAACCTC/ /SEQ 10 N0:4307 

NT2RP2006092/ACCTACGTTGTGGTGCAGTG//SE0 ID N0:43C8 
NT2RP2006I34//TGACTGCCMTTAGAAACCT//:5EO ID N0:4309 
NT2RP2006208/ / AAAATTGGCi TCTGCCTAGT//SEQ !D NO:43-0 
NT2RP20C6476//GTTCACCTrCACAJCCAAAA//SEQ ID N0.431 1 
NT2RP30000I I//TGGCACAATCICAGCTCACT//SEO !D N0:4312 
iN T 2RP3000031 / /TTCTGGGCTGGAGIAGTGGT //SEO 10 N0.43I3 
NT2RP3000063//AGATAGTCACATAGACAGAG//SEQ ID N0:4314 
NI2RP300OI25//ACACATCCAACCCITCACn//SE0 ID N0:43I5 
NT2RP300G 148// AACAAGTCCAGCCCCAGAAG//SEQ D N0:43 1 6 
NT2RF3000169//CCMGCACGCCATATGAAGC//SEQ ID W:4317 
NT2RP3000I7I//CAGAATTTTGCCACGAGGAT//SEQ ID N0:4318 
NI2RP3000172//GGCACAGACACCATCCTTGA//SEG ID N0:43?9 
NT2RP300020 1 / / AATGGGGTTTTGCCAGGTTG//SEO ID N0:4320 
NT2RP3000232//ACCT:CAIACAACTTTTCCC//:5EQ ID N0:4321 
NT2RP30003G4//TGGTCTGCCTCATCCTTCTC//SEO ID NO:4322 
NT2RP30Q0378//GA4AAGGG1AGGCAGCAAGG//SEQ ID NO:4323 
NT2RP300G427/ / ATGIAAAGIGT GGAGTACCC/ /SEQ ID N0.4J24 
NT2RP30C0444/ /TTCTTCTC AGTC ACCTCCAC //SEO ID NC:4325 
NT2RP30006I6//GTGATAGTAACACAATCCTG//SEQ ID N0:4326 
NI2SP3000645//TGGCCAGIGTATGAGAGGTC//SEQ ID NC:4327 
W2RP3000676//CMGAQWCAAAACAGAAGG//SEQ 0 N0:4328 
NT2RP3000677//TCCATTTTACCTGTAGACAC//SEQ ID N0.4329 
NT2RP3C9072 1 / / GAGACTGTGA7GCCTTGGTG//SEQ ID NC:4330 
NT2RP3000789//AACTGATGGCTTCTGTCTCC//SE0 ID N0:4331 
NT2RP30C08 1 8//GGGAGAAGCCTAGAAAAGAA/ /SEQ ID N0:4332 
NT2RP30GC820//TCACATTCAAGCICTACGTC//SE0 ID N0:4333 
NT2RP300G838//CTAGCTTCCTCTCCCAACTG//SEG ID N0:4334 
NT2RF3000871//GATTGTGCITGGAGCTTGGC//SE0 ID N0:4335 
NT2RP3000907//GGGAGATGAAGAGGAAGCAG//SE0 ID NC:4335 
NT2RP3000921//ACATGGGTAGCACTCCTTTT//SEQ ID N0:4337 
NT2RP3001012//ACCCATTTCTACCTCTCTTA//SEQ ID N0:4338 
NT2RP300 1 044/ /AGCAAGGCAATGAGGATACT//SEQ ID N0:4339 
NI2RP300: 1 59/'/ AAATGTAGGGGTGGGGAAGT//SEQ 10 NO:4340 
NI2RP300 1240// AGCAAGGTCTCCAGAGTGTG / /SEO ID NO:434i 
NT2RP3001271//GTGCTICTACCAITGACGAT//SEO ID NO:4342 
NI2RP3001542//TGCTCCICACTGCCnCAAA//SEO ID NO:4343 
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NT2RP300 1 560//CGCCTCCACAAACAAACCCT//SE0 ID N0:4344 
MT2RP30C1592/7TMAAGGGAGGATGT[CAGT//!)E0 D N0:4345 
NT2RP30G 1 650//GCATCTTCTGGTAGnGTCC//SEO ID N0:434B 
NT2RP300 1 685//GATTCTCCTTGCCTCAGCCT//3E0 ID NO:*347 
NT2RP3001 754//AGTTAGTGGTGCCTGCTTCC//5EQ ID N0:434S 
NT2RP300 1 976//CTCACTGGCATTTAGCTGGT//SEQ ID N0:4349 
NT2RP30020 1 5//CAACAACCTTCCTCCTACCC//SEQ ID N0:4350 
NT2RP3002281/ /TAAACAGTCAACCAATGCTC//SEQ ID NQ:4351 
NT2 RP3002286 / / AAGCAAGAGATTTGG AGGAA/ /5EQ ID NO:4352 
NT2RP3002353//GGCTGGAACTCAATTICTGC//5EQ ID N0:4353 
NT 2 RP300 2 4 09//GTAC CCCT AGTG AAGACCTG// SEQ ID N0:4354 
NT2RP300241I//TCAATAGCTACCTGTAGAGT//5EQ ID N0:4355 
NT2RP3002448//TCTTCCTACGACATAACCAT//SEO ID N0:4355 
NT2 RP3002 57 1 //GCCCAAAATCCAACAGTAAA//SEG ID N0:4357 
NT2RP300272 1 //AGTGGCTTGTATTCTGTGGA//SEQ ID N0:4358 
NT2RP3002737/ /GTGAGGTACTTGCATATCCG//SEQ ID N0:4359 
NT2RP3002738//TCCCCGATGAACACCAGCTT//SE0 ID N0:4360 
NT2RP3002790//CTGGAGCCTGAnATGAGAA//SEO ID N0:4361 
NT2 RP30 02 83 6// C ITT AG CAACATAACCTCCA/ / SEQ ID N0:4362 
NT2RP3002900//nTCTTCCTCCCTAACACAT//SEQ ID N0:4363 
NT2RP3002958//GCCTTCTTCCAGCTCTACAT//SEQ ID N0:4364 
NT2RP3002983//AATTTCCTTGCTAGAAGGCT//SEQ ID N0:4365 
NT2RP30G3354//GGCTGACACCTATTATCCCA//SEQ ID N0:4366 
NT2RP3003448//TGTAGTCCaGCTATTCAGG//SE0 ID N0:4367 
NT2RP3003473//AATTAATCTCTGGGTAGCAC//SEQ ID N0:4368 
NT2RP3003527//TGTCAATGGCCAGGTCGTAG//5EO ID N0:4369 
NT2RP20Q3532//TAMCTGACTTCCTCTTGGG//SEQ B N0:4370 
N12KP30Q3535//CTAATGCCAG7GTTTCAAGA//SEQ ID N0:4371 
MT2RP3003559/ /AATTCTTGTCCTGCTTTTGC//SEQ ID N0:O72 
NT2RP30036 1 4//GGGTCTTTGGATTGAGTGTC/ /SEQ ID N0 4373 
NT2RP30039S3//CACGATTATCTCTGCCAAAA/ySE0 ID N0:4374 
NT2RP30040OO//TGGCCAGTGTATGAGAGGTC//SEQ ID N0:4375 
NT2RP3004025//CACATTCTGGTGGAAAAGCA//SEQ ID N0:4376 
NT2RF3004067//TGATGACACGCAGCACTTCC//SEO ID NO:4377 
NT2RP3004075//TTTCAAGTCAACACCTGCAC//5EQ ID NO:4378 
NT2RP3004090//TACACTACAGATGGGCAAAA//SEO ID N0:43?9 
M2RP30Q4 1 1 9//TCTMGGCTGGGTACAGIGG//SEQ ID N0:4330 
NT2RP3O04 1 30// ACATTTCTCCTCCATACGCA//5EG ID N0:4381 
N 1 2RP3C04 1 33//TAACCGCACTATGAGGAAAG//SE0 ID NO:4332 
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NT2RP3004202//CAGIGGCCTGGTAATAGAGT//3EQ ID N0:4383 
NT2RP3C04294/ / ATATTCCACTCCCCATTCGG//3EQ 10 N0:4384 
NT2RP3004321//GACCAACATGCTAGAAC?GC//SEQ 10 N0:4335 
NT2RP3004345//AACTCTCATTCCATAAGGTG//StO ID N0:4386 
NT2RP3004406//CACCTAAMGACTAATCCCT//SEQ 0 NO:4387 
NT2RP3004552//GAATCCAAAAGCCGGTAGGG//SEQ ID N0:4388 
NT2R P3004 557/ /GAAAGAGGTCAMGTACCTG//SEQ ID NO:4389 
NT2RP3004625//ATACAGGCAGCAGGAATCAC//SEO ID N0:4390 
NT2R P3004 64 7//CMG ACAACAAAACAGMGG //SEQ ID N0:4391 
N T 2RP4000634//ATGAAGAGTTACCIATGTGG//SEQ ID NO:4392 
NT2RP4000952//AACCCTGGGCJTGGAATTCA//SEQ ID N0:4393 
NT2RP4C0'001//TTTCACAATGCTACAGAGGT//SE0 ID NO:4394 
NF2RP4001 009/ /AATGICAGCGGAGCAAAAGA//SEO ID NO:4395 
Nf 2RP4001 467//GTTGAGAATGCTGGACTTGA//5E0 ID N0:4396 
NT2RP400I877//AATCATA1AGTCCCAGGTTG//SE0 -D N0:4397 
NI2RP4001879//GTGTGAGAGTAGTTGGGGAA//SEQ ID W:4398 
NT2RP4002187//TCAATAGCTACCTGIAGAGT//SEQ ID N0:4399 
NT2RP400245 1 / / CATAAACAGTGACAGCAGAA//SEO ID N0:4400 
NT2RP400271 5/ /AGCAAGGCAATGAGGATACT//SEG !D N0:440i 
NT2RP4002750//CAGCAI7TAGGTGTGACGAT//SEQ ID N0:4402 
PLACE1000040//ACATTGCTTGAGTCTTGCCA//SEQ ID N0:4403 
PLACE 1000986//CMGGAGGTMTAGGGAGAT//3E0 ID N0:4404 
PLACE 1 002080// CTCACTCTGTCATCGAGGCT//SEQ ID N0:4405 
PLACE 1 0025 4 7// GTACCCCTAGTGAAGACCTG//5E0 ID N0:4406 
PLACE1 00291 1//CAAAGCCMCICITCACGCT//SEQ 10 N0.4407 
PLACE 1 003407// GACTACAAAACCCTGGAAAG//SEO ID N0:4408 
PLACE1003573//GGTGGMGTAMTAAGACT//SE0 ID N0:4409 
PLACE 1004076/ /CTCAGCCTTCCAAGTAGCAG//SEO ID N0:4410 
PLACE 1004 1 99/ /GGAATCTGGGAGTCAAAATC//SEO ID N0:4411 
PLACE 1004305//GCACCACTTTCGTCTTGAGC//SEQ 10 NO:4412 
FLACE1004450//GGCACTTAGCTTCTTGTTTT//SEO ID N0:4413 
PLACE 1 004492/ /TGGGCATCAATAAACACCIC//SEO ID N0:4414 
PLACEI004630//TGCniIGTATGGCCTGGGA//SE0 ID N0:4^15 
PLACE 1 0048 1 6/ / ATGTGGACAGGCAA.GCAGAG //SEQ 10 NO:4416 
PLACE 1 00503 1 / / GGTGTTCCCCTTCTTGITTG//SEQ ID N0:44I7 
PLACE 1 005539/ / AGAGGTAGAATGAGGGGAAA//SEQ ID N0:4418 
PLACE1005569/7CTGAACCCTGGCCTGTGAAA//SE0 ID >J0:44I9 
PLACE! 00560 1 //AGATGGGGACTATGAAGAGG//SEO ID N0:4420 
PLACE1005735//CCTGTAATCCCAGCACTTTG//SEO ID NC:4421 
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PWCEl0058l5//AGAGACAGACIinGCTCTT//SE0 ID HQA422 
PLACE 1 00592 7/ / GGTGTAGC CCATTG T.AGAGG // S EO ID N0:4423 
PLACE 1 006G7 1 //TTTGCTCTTTAACTCTGCCT//SEQ ID NO:4424 
PLACE 1 006073/ /TAAAGGGCAGGGTGGAGGAA//SEQ 10 N0:4425 
PLACE 1006079//TGATGTCCACTGTC1TATTG//SEQ 10 NC:4426 
PLACE 1 006756/ / CAAGGAGGTAATAGGGAG AT//SEQ ID N0:4427 
PLACE 1 0O7077//ATGTGCTGGTAAGTGAAATC//SEQ ID N0:4428 
PLACE 1 007061 / /GCCGGCCAAGTTTACACGAA//SEQ ID N0:4429 
PLACE1 007845/ /TAACCGCACTATGAGGAAAG//SEO ID NO:4430 
PLACE 1 00797 1 //GACAATTCAACTGGAAGACC//5EQ ID N0:443l 
PLACE 1 0082B2//CGTGATGACTGCCAACTCCA// SEQ ID NO:4432 
PLAC-:t008359//AACAGGGTCCCATTTAmG//SEO ID N0:4433 
PLACE 1008469//TTGTCATCCTCCTGCCTCTG//SEQ ID N0:4434 
PlACE 1008657//CTCAGCCTTCCAAGTAGCAG//SEQ 10 N0:4435 
PLACE 1008744//TAAACAAGACCCAGCACCAT//SEO ID NO:4436 
PLACE 1 0059 34//TCGAGACCGCTTCCCATAGA//SEQ ID N0:4437 
PLACE 1 009546//AGGTAGCCGATGACAAAGCC//SEO ID NQ:4438 
PLACE 1 0 1 0C 1 1 //GGATAIAAAGACAAGGATGC// SEQ ID N0:4439 
PLACE10107 1 3//TCAATAGCTACCTGTAGAGT//5E0 ID N0:4440 
PLA.CE101 1 C1 9//AAGGAGTGAAGCTCCATGCC//5EQ ID N0:4441 
FLACE10M I I6//CCATTACAAACCCTTTAACC//SE0 ID N0:4442 
PLACE 1011181 //TGTTATCCATTGnCACCTG//SEQ ID NG:4443 
PLACE101 1 364//TGTTGACGCTGATTTAGGCA//SE0 ID N0:4444 
PLACE30002 1 3//TAAACAAGACCCAGCACCAT//SEQ ID NQ:4445 
PLACE4000354//TAAACAAGACCCAGCACCAT//SE0 ID N0:4446 
PLACE 400G455//CA4GGAGGTAATAGGGAGAT//SEQ ID N0:4447 
SKNMC10QC01 4//CGACACAGGGTCTTGGTT7G/ /SEQ ID NO:4445 
SKNMC100C082//TTTCCTTCGGCTGGTATGGC//SEQ ID N0:4449 
Y79AA1000C30//TCCTACGITGTGGTGCAGTG//S:O ID N0:4450 
Y79AA1000037//AGCACCCCTTATTGGAAGAT//SEQ ID NO:4451 
Y79AA1 000 127/ /GATGCTTACTCCTCTTTGGT//SEQ ID N0;4452 
Y79M1 000226/ /TGCTGATGCCTC 'GTTCCCT/ /SEQ <D N0;4453 
Y79AA 1 00C2 7 0// G AAAC AACCAAGC ACCCCAT //S E Q ID N0;4454 
Y79AA 1 000 750//ATAAGGGCAGCTGGGAAGTG/ /SEQ ID N0:4455 
Y79AA: 000776/ /TGTTAGTAGCAGGAGGAAGC//SEQ ID NO:4456 
Y79AA.:000777//ACAGAGTTCAGTCCCCnA//SEQ ID N0:4457 
Y7SAA1 000876/ /GTTAC-AGGACACTGGCATCA/ /SEQ 0 NO:4458 
Y79AA1 000888/ /ACTGACTTCAGGAATAAGCC//SEG ID N0;445S 
Y79AA1 000959/ /ACAGTGAGAGCATGCGAGGT//SEQ ID NO:4460 
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Y79M1000967//GGCACAGACACCATCCTTGA//5EQ 10 N0;4461 
Y79AA 1 00 1 C56//ACAAATGAGCCTGAAAAGTC//S tO 10 N0:4452 
Y79AA1 00 1 062/ /TGGTCCTCACTGCCTTCAAA//SEQ 10 N0:4463 
Y79AA1 00 1 C90//AGTGCCCTCAAAGCTCCAGT//SEQ 10 N0:4464 
Y79AA 1 00 1 2 1 2// ACGAAAGCACTCAAATGTCA//SEQ ID NO:4465 
Y79.AA 1 00 1 272// GMTGAAATGTGGTTGAGCA/ /SEQ 10 N0:4466 
Y79AA 1 00 1 426// AATGATTCGGGGCAGCAGGA//SEQ 10 N0:4457 
Y79AA 1 0C 1 42 7//GAGAGAGACACACACAGAAA//SEQ ID N0:4468 
Y79AA1 00 1 523// AGTTTTATACCAGC ATTGGC //SEQ ID NO:4469 
Y79AA 1 00 1 530// GGTGTAGAAGTAAAATGGGA// SEQ ID NO:4470 
Y79AA1 001 592//GATTGTGTTCTCTTACTCCT//S':0 ID N0:447i 
Y79AA1001727//GCTCCACCTGACGTTCTTTA//SEQ ID N0:4472 
Y79AA100I795//GTCTCCCATATCGCTGTCTT//SEQ ID N0:4473 
Y79AA1001803//CACTTTCTAATAACCCTTGG//SEQ ID N0:4474 
Y79AA 1001 863/ /TTGGGATTGGAAACCGCATT/ /SEQ ID N0:4475 
Y79AA1 001 874// AGAAACCACTGAGGCCCAAG/ /SEQ ID NO:4476 
Y79AA1 002058/ /CAGAAGCAGAAGCAGGAGCA//SEQ ID NO:4477 
Y79AA1002i2i//ATTTACTGCGATTTCTCCTG//SEQ 10 N0:4478 
Y79AA1 002 1 29//GAGTTTCTTTGCTAGTTCCA/ /SEQ ID N0:4479 
Y79AA1002334//ATATTTGTGTTTGCCTTGGG//SEQ ID N0:4480 
Y79AA1 002373/ / GGATGGCTGGGTCAAATGGT//SEQ ID N0:4481 
Y79AA 1 002376//AATGATGGCTAGGGTGACTT/ /SEQ ID N0:4482 
Y79AA1002378//TCTTCCCACATTCGTTACAC//SEQ ID N0:4483 
Y79AA 1 00238 1 // AGGGAGTAGATGTTGGTAAA//SEQ ID N0:4484 

[0279] The result of expression frequency analysis is shown in Table 367. Only clones with correlation coefficient of 
0.9 or higher are indicated in this Table. Clones that are not presented in the Table include clones for which the assay 
could not performed because of low expression levels thereof in internal standard samples or because of unexpectedly 
smaller or larger sizes of the PCR products. 

[0280] Among the clones that could be analyzed, clones of which expression levels increased by two fold in response 
to the IL-1. stimulation 1 or 7 hours after the stimulation are: NT2RM2000514, NT2RP3001159. MAMMA1001237and 
MAMMA1000614. 

[0281] Clones of which expression levels increased by two fold in response to the TNF-stimulation 1 , 3 or 7 hours 
after the stimulation are: 

NT2RM2000582, NT2RM2002109, NT2RP1000679, NT2RP2003664, NT2RP2005597, NT2RP2004108 
NT2RP3001592, NT2RP3002738, NT2RP3004133, NT2RP3004321 , NT2RP3004557,' NT2RP3004294' 
MAMMA1001237, MAMMA1000141, MAMMA1000788, MAMMA1 002070, PLACE1002547, PLACE1003573 
PLACE1 004305, PLACE1008744, PLACE1011181, PLACE1010713. PLACE1010011 Y79AA1000776 
Y79AA1002129, 

[0282] Among the clones of which expression levels increased in response to IL-1 . stimulation, MAMMA1001237 
was a clone of which expression level was varied in response to TNF-. stimulation. Among clones showing higher 
expression levels (with relative value of 5 or higher) prior to the stimulation, PLACE1 002080 is an example of clones 
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of which expression was suppressed by the stimulation. The expression of the clone decreased by three or more fold 
in response to the stimulation. These genes were found to be associated with inflammatory reaction induced by IL-1 . 
or TNF-.. 

[0283] In Example 1 5, the genes of which expression levels were varied by culturing in the presence of TNF-. were 
analyzed by hybridization with high-density DNA filter. As for 3 clones (NT2RP3004557, NT2RP3004294 and PLACE 
1 002547), the results obtained by ATAC-PCR method were similar to those obtained by hybridization method. However, 
the results obtained by ATAC-PCR method were not necessarily consistent with those obtained by the hybridization 
method. Possible reasons for the inconsistency are the difference in cells used between the two experiments, unavail- 
ability of some data in the ATAC-PCR experiment, and the difference in the method of data treatment. 
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Table 28 

Expression of each cDNA in human tissues (The Table also contains clones with no 
15 description in Examples) 
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79.670 


40. 928 


BRAWH 1000029 


106. 376 


49.221 


55. 840 


40.856 


59. 552 


56.487 


64.886 


100. 132 


AAA Ml 1 ft) A AAA A A 

BRAWH1 000040 


29.419 


16.761 


31. 101 


15.622 


30. 633 


18. 200 


17.998 


15. 196 


A n ft, hfttl ft ft A AAA f* A 

BRAWH1 000050 


161. 264 


71.786 


118. 976 


51.863 


61.542 


97.720 


81.271 


69.194 


AAA Iftn J * A A A A ft* 4 

BRAWH 1000051 


74. 067 


34.341 


44. 047 


20. 726 


30.434 


42.055 


53.856 


24.624 


unniui i uuuvdv 


fiR 7ftQ 

OO. (Oj 


LL. 930 


J j. u 1 c 


k ini 
1 b. 493 


19. 12* 


38. 662 


A l A A 4 

34. 923 


28. 094 


BRAWH 1000075 


17.318 


16.898 


36.437 


8.901 


18.133 


17.219 


| 9.321 


11.200 


BRAWH 1000081 


43.025 


12.998 


28. 267 


7.655 


123.677 


17.673 


15.924 


9.844 


BRAWH 1000084 


174. 384 


42. 178 


80. 534 


47.752 


152. 188 


77. Ill 


110. 1S7 


102. 296 


BRAWH 10000 95 


118.239 


59. 676 


64. 528 


28. 174 


116.975 


53.814 


746. 700 


35. 985 


BRAWH1 000096 


145. 112 


44. 96? 


85. 882 


27.491 


145.013 


52. 880 


52.427 


58.678 


BRAWH 1000097 


95.841 


1 72.506 


174.954 


65.637 


64. 200 


73.707 


63.827 


63.762 


BRAWH1000100 


11.943 


19.037 


18.950 


13.536 


92. 145 


16. 582 


16.646 


10.218 


BRAWH100010) 


134.838 


57.23? 


106.632 


40. 741 


96.395 


71.642 


88.432 


57. 336 
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NASE 1)7/1 .00E-77//359aa//44%//Q14012 
C-Y79AA1001013 

C-Y79AA1001056//Homo sapiens MAID protein mRNA, complete cds.//07/1475bp//99%//AF1 13535 
C-Y79AA1001062//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1, ENDOTHELIAL (B12 PRO- 
5 TE I N) 7/8 . 90 E- 1 2//1 32aa//38%//Q 1 3829 

C-Y79AA1001090//NUCLEAR FACTOR NF-KAPPA-B P105 SUBUNIT (DNA-BINDING FACTOR KBF1) (EBP- 1) 
(NF-KAPPA-B1 P84/NF-KAPPA-B1 P98) [CONTAINS: NUCLEAR FACTOR NF- KAPPA-B P50 SUBUNIT] (FRAG- 
MENT) .//4.50E-09//1 44aa//3 1 %//Q63369 

C-Y79AA1001212//Homo sapiens SL15 protein mRNA, complete cds.//6.30E-306//1388bp//99%//AF038961 
w C-Y79AA1001264//HYPOTHETICAL 39.9 KD PROTEIN T15H9.1 IN CHROMOSOME II PRECURSOR7/5.10E- 

1 06//351 aa//58%//Q1 0005 

C-Y79AA1001272//Homo sapiens retinoic acid repressible protein (RARG-1) mRNA, complete cds.//1 .50E-183// 
867bp//98%//AF1 72066 

C-Y79AA1001328//Mus musculus mRNA for DII3 protein, complete cds.//1.90E-263//1988bp//79%//AB013440 
15 C-Y79AA1001426//ANION EXCHANGE PROTEIN 3 (CARDIAC/BRAIN BAND 3-LIKE PROTEIN) (CAE3/BAE3).// 

6.20E-66//609aa//31 %//P48751 

C-Y79AA1001427//Homo sapiens cytochrome b5 reductase 1 (B5R.1) mRNA, complete cds.//0//1588bp//99%// 
AF1 69481 

C-Y79AA1001430//Homo sapiens mRNA for KIAA0469 protein, complete cds.//0//2943bp//99%//AB007938 
20 C-Y79AA1001523//Homo sapiens transcriptional intermediary factor 1 alpha mRNA, complete cds.//0//2263bp// 

99%//AF1 19042 

C-Y79AA1001530//Human beta-tubulin gene (5-beta) with ten Alu family members .//0//1 920bp//98%//X00734 
C-Y79AA1001592 

C-Y79AA1001727//CELL SURFACE A33 ANTIGEN PRECURSOR.//! .10E-13//286aa//27%//Q99795 
25 C-Y79AA1001787//PROBABLE CALCIUM-TRANSPORTING AT PAS E 9 (EC 3.6.1 .38)7/1 .70E-133//544aa//37%// 

Q12697 

C-Y79AA1 001 793//Mus musculus mRNA for GSG1 , complete cds.//3.70E-1 26//532bp//78%//D87325 
C-Y79AA1001795//Homo sapiens mRNA for Ga(T4 protein 7/2.30E-250//1137bp//99%//Y1 5061 
C-Y79AA1001799//MITOCHONDRIAL RNA SPLICING PROTEIN MSR47/3.40E-54//182aa//39%//P23500 
30 C-Y79AA1001803//Homo sapiens secretogranin III mRNA, complete cds7/0//1871bp//99%//AF078851 

C-Y79AA1001863 

C-Y79AA1002022//POLIOVIRUS RECEPTOR HOMOLOG PRECURSOR.//2.20E-06//140aa//26%//P32507 
C-Y79AA100205B//Mus musculus Gng3lg mRNA, complete cds7/4.10E-167//1145bp//83%//AF069954 
C-Y79AA1002121//HISTONE H1 7/4.90E-12//114aa//35%//P35060 
35 C-Y79AA1002129 

C-Y79AA1 00221 3//HYPOTHETICAL 52.7 KD PROTEIN C38C10.2 IN CHROMOSOME IU7/1.20E-98//262aa// 
41%//Q03567 

C-Y79AA1002334//GLUCOSE REPRESSION MEDIATOR PROTEIN7/1 .70E-1 0//333aa//23%//P14922 
C-Y79AA1002373//Mus musculus mRNA for GSG1 , complete cds.//7.20E-147//680bp//79%//D87325 
40 C-Y79AA1 002376//Rattus norvegicus cytoplasmic dynein intermediate chain 2B mRNA, complete cds.//1.50E- 

304//1 667bp//90%//U39045 

C-Y79AA1002378//Homo sapiens zinc finger protein NY-REN-21 antigen mRNA, partial cds7/0//963bp//99%// 
AF155100 

C-Y79AA1 002381 //Homo sapiens cell cycle related kinase mRNA, complete cds.//0//1791bp//98%//AF035013 

45 

Claims 

1 . Use of an oligonucleotide as a primer for synthesizing the polynucleotide comprising the nucleotide sequence set 
50 forth in any one of SEQ ID NOs: 1 -829 and 2545, or the complementary strand thereof, wherein said oligonucleotide 

is complementary to said polynucleotide or the complementary strand thereof and comprises at least 15 nucle- 
otides. 

2. A primer set for synthesizing polynucleotides, the primer set comprising an oligo-dT primer and an oligonucleotide 
55 complementary to the complementary strand of the polynucleotide comprising the nucleotide sequence set forth 

in any one of SEQ ID NOs: 1-829 and 2545, wherein said oligonucleotide comprises at least 15 nucleotides. 

3. A primer set for synthesizing polynucleotides, the primer set comprising a combination of an oligonucleotide com- 
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prising a nucleotide sequence complementary to the complementary strand of the polynucleotide comprising a 5'- 
end nucleotide sequence and an oligonucleotide comprising a nucleotide sequence complementary to the poly- 
nucleotide comprising a 3'-end nucleotide sequence, wherein said oligonucleotides comprise at least 15 nucle- 
otides and wherein said combination of 5'-end nucleotide sequence.3'-end nucleotide sequence is selected from 
the group consisting of: 



SEQ ID NO:4 and SEQ ID NO:830 
SEQ ID NO:5 and SEQ ID NO:831 
SEQ ID NO:6 and SEQ ID NO:832 
SEQ ID NO:7 and SEQ ID NO:833 
SEQ ID NO: 8 and SEQ ID NO: 834 
SEQ ID NO:9 and SEQ ID NO:835 
SEQ ID NO:l 1 and SEQ ID NO:836 
SEQ ID NO: 12 and SEQ ID NO:837 
SEQ ID NO: 13 and SEQ ID NO:838 
SEQ ID NO: 14 and SEQ ID NO:839 
SEQ ID NO: 15 and SEQ ID NO:840 
SEQ ID NO: 16 and SEQ ID NO:841 
SEQ ID NO: 17 and SEQ ID NO:842 
SEQ ID NO: 18 and SEQ ID NO: 843 
SEQ ID NO: 20 and SEQ ID NO:844 
SEQ ID NO: 22 and SEQ ID NO: 845 
SEQ ID NO:23 and SEQ ID NO:846 
SEQ ID NO:24 and SEQ ID NO:847 
SEQ ID NO:25 and SEQ ID NO:848 
SEQ ID NO:26 and SEQ ID NO:849 
SEQ ID NO:27 and SEQ ID NO:850 
SEQ ID NO:28 and SEQ ID NO:85 1 
SEQ ID NO:29 and SEQ ID NO:852 
SEQ ID NO:30 and SEQ ID N0.853 
SEQ ID NO:3 1 and SEQ ID NO:854 
SEQ ID NO:32 and SEQ ID N0.855 
SEQ ID NO:33 and SEQ ID NO:856 
SEQ ID NO:34 and SEQ ID NO:857 
SEQ ID NO:35 and SEQ ID N0.858 
SEQ ID NO:36 and SEQ ID NO:859 
SEQ ID NO:37 and SEQ ID NO:860 
SEQ ID NO:39 and SEQ ID NO:861 
SEQ ID NO:40 and SEQ ID NO:862 
SEQ ID NO:41 and SEQ ID NO: 863 
SEQ ID NO:42 and SEQ ID NO: 864 
SEQ ID NO:44 and SEQ ID NO:865 
SEQ ID NO:45 and SEQ ID NO:866 
SEQ ID NO:46 and SEQ ID NO:867 
SEQ ID NO:47 and SEQ ID NO:868 
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SEQ ID NO:48 and SEQ ID NO:869 
SEQ ID NO:49 and SEQ ID NO:870 
SEQ ID NO:50 and SEQ ID NO:871 
SEQ ID NO:52 and SEQ ID NO:872 
SEQ ID NO:53 and SEQ ID NO:873 
SEQ ID N0.54 and SEQ ID NO:874 
SEQ ID NO:55 and SEQ ID NO:875 
SEQ ID NO:56 and SEQ ID NO:876 
SEQ ID NO: 57 and SEQ ID NO: 877 
SEQ ID NO:58 and SEQ ID NO:878 
SEQ ID NO:59 and SEQ ID NO:879 
SEQ ID NO:60 and SEQ ID NO:880 
SEQ ID N0:61 and SEQ ID NO:881 
SEQ ID NO:62 and SEQ ID NO:882 
SEQ ID NO:63 and SEQ ID NO:883 
SEQ ID NO:64 and SEQ ID NO:884 
SEQ ID NO:65 and SEQ ID NO:885 
SEQ ID NO:66 and SEQ ID NO:886 
SEQ ED NO:67 and SEQ ID NO:887 
SEQ ID NO:69 and SEQ ID NO:888 
SEQ TD NO:70 and SEQ ID NO:889 
SEQ ID N0:71 and SEQ ID NO:890 
SEQ ID NO:72 and SEQ ID NO:89l 
SEQ ID NO:73 and SEQ ID NO:892 
SEQ ID NO:74 and SEQ ID NO:893 
SEQ ID NO:75 and SEQ ID NO:894 
SEQ ID NO:76 and SEQ ID NO:895 
SEQ ID NO:77 and SEQ ID NO:896 
SEQ ID NO:78 and SEQ ID NO:897 
SEQ ID NO:79 and SEQ ID NO:898 
SEQ ID NO:80 and SEQ ID NO:899 
SEQ ID N0:81 and SEQ ID NO:900 
SEQ ID NO: 82 and SEQ ID NO:901 
SEQ ID NO:83 and SEQ ID NO:902 
SEQ ID NO:84 and SEQ ID NO:903 
SEQ ID NO:85 and SEQ ID NO:904 
SEQ ID NO:86 and SEQ ID NO:905 
SEQ ID NO:87 and SEQ ID NO:906 
SEQ ID N0.88 and SEQ ID NO:907 
SEQ ID NO:89 and SEQ ID NO:908 
SEQ ID NO:90 and SEQ ID NO:909 
SEQ ID N0:91 and SEQ ID NO:910 
SEQ ID NO:92 and SEQ ID N0:9 1 1 
SEQ ID N0:93 and SEQ ID NO:912 
SEQ ID NO:94 and SEQ ID NO:913 
SEQ ID NO:95 and SEQ ID NO:914 
SEQ ID NO:96 and SEQ ID NO:915 
SEQ ID NO:97 and SEQ ID NO:916 
SEQ ID NO:98 and SEQ ID NO:917 
SEQ ID NO:99 and SEQ ID NO:918 
SEQ ID NO: 100 and SEQ ID NO:919 
SEQ ID NO: 101 and SEQ ID NO:920 
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SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO 
SEQ ID NO; 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 
SEQ ID NO 



02 


and 


SEQ 


03 


and 


SEQ 


04 


and 


SEQ 


05 


and 


SEQ 


06 


and 


SEQ 


07 


and 


SEQ 


08 


and 


SEQ 


09 


and 


SEQ 


10 


and 


SEQ 


11 


and 


SEQ 


12 


and 


SEQ 


13 


and 


SEQ 


14 


and 


SEQ 


15 


and 


SEQ 


16 


and 


SEQ 


17 


and 


SEQ 


18 


and 


SEQ 


19 


and 


SEQ 


20 


and 


SEQ 


22 


and 


SEQ 


23 


and 


SEQ 


24 


and 


SEQ 


25 


and 


SEQ 


26 


and 


SEQ 


27 


and 


SEQ 


28 


and 


SEQ 


29 


and 


SEQ 


30 


and 


SEQ 


31 


and 


SEQ 


32 


and 


SEQ 


33 


and 


SEQ 


34 


and 


SEQ 


35 


and 


SEQ 


36 


and 


SEQ 


37 


and 


SEQ 


38 


and 


SEQ 


39 


and 


SEQ 


40 


and 


SEQ 


41 


and 


SEQ 


42 


and 


SEQ 


43 


and 


SEQ 


44 


and 


SEQ 


45 


and 


SEQ 


46 


and 


SEQ 


47 


and 


SEQ 


48 


and 


SEQ 


49 


and 


SEQ 


50 


and 


SEQ 


51 


and 


SEQ 


52 


and 


SEQ 


53 


and 


SEQ 


.54 


and 


SEQ 



IDNO:921 
ID NO:922 
ID NO:923 
ID NO:924 
ID NO:925 
ID NO:926 
ID NO:927 
ID NO:928 
ID NO:929 
ID NO:930 
IDNO:931 
ID NO:932 
ID NO:933 
ID NO:934 
ID NO:935 
ID NO:936 
ID NO:937 
ID NO:938 
ID NO:939 
ID NO:940 
IDNO:941 
ID NO:942 
ID NO:943 
ID NO:944 
ID NO:945 
ID NO:946 
ID NO:947 
ID NO:948 
ID NO:949 
ID NO:950 
ID NO:951 
ID NO:952 
ID NO:953 
ID NO:954 
ID NO:955 
ID NO:956 
ID NO:957 
ID NO:958 
ID NO:959 
ID NO:960 
IDNO:961 
ID NO:962 
ID NO:963 
ID NO:964 
ID NO:965 
ID NO:966 
ID NO:967 
ID NO:968 
ID NO:969 
ID NO:970 
ID NO:971 
ID NO:972 



1356 



EP1 130 094 A2 



SEQ ID NO: 155 and SEQ ID NO:973 
SEQ ID NO: 156 and SEQ ID NO:974 
SEQ ID NO: 157 and SEQ ID NO:975 
SEQ ID NO:158 and SEQ ID NO:976 
SEQ ID NO: 159 and SEQ ID NO:977 
SEQ ID NO: 160 and SEQ ID NO:978 
SEQ ID N0:16l and SEQ ID NO:979 
SEQ ID NO:162 and SEQ ID NO:980 
SEQ ID NO: 163 and SEQ ID NO:981 
SEQ ID NO: 164 and SEQ ID NO:982 
SEQ ID NO: 165 and SEQ ID NO:983 
SEQ ID NO: 166 and SEQ ID NO:984 
SEQ ID NO: 167 and SEQ ID NO:985 
SEQ ID NO: 168 and SEQ ID NO:986 
SEQ ID NO: 169 and SEQ ID NO:987 
SEQ ID NO: 170 and SEQ ID NO:988 
SEQ ID NO: 171 and SEQ ID NO:989 
SEQ ID NO: 172 and SEQ ID NO:990 
SEQ ID NO: 173 and SEQ ID NO:991 
SEQ ID NO: 174 and SEQ ID NO:992 
SEQ ID NO: 175 and SEQ ID NO:993 
SEQ ID NO: 176 and SEQ ID NO:994 
SEQ ID NO: 177 and SEQ ID NO:995 
SEQ ID NO: 178 and SEQ ID NO. 996 
SEQ ID NO: 179 and SEQ ID NO:997 
SEQ ID NO: 180 and SEQ ID NO:998 
SEQ ID NO: 181 and SEQ ID NO:999 
SEQ ID NO: 182 and SEQ ID NO:1000 
SEQ ID NO: 183 and SEQ ID NO: 1001 
SEQ ID NO: 184 and SEQ ID NO: 1002 
SEQ ID NO: 185 and SEQ ID NO: 1003 
SEQ ID NO: 186 and SEQ ID NO: 1004 
SEQ ID NO: 187 and SEQ ID NO: 1005 
SEQ ID NO: 1 88 and SEQ ID NO: 1 006 
SEQ ID NO: 189 and SEQ ID NO: 1007 
SEQ ID NO: 190 and SEQ ID NO: 1008 
SEQ ID NO: 191 and SEQ ID NO: 1009 
SEQ ID NO:193 and SEQ ID NO: 1010 
SEQ ID NO: 194 and SEQ ID NO: 101 1 
SEQ ID NO: 195 and SEQ ID NO: 1012 
SEQ ID NO: 196 and SEQ ID NO: 101 3 
SEQ ID NO: 197 and SEQ ID NO: 1014 
SEQ ID NO: 1 98 and SEQ ID NO: 1 0 1 5 
SEQ ID NO:199 and SEQ ID NO:1016 
SEQ ID NO:200 and SEQ ID NO: 1017 
SEQIDNO:201 and SEQ ID NO: 1018 
SEQ ID NO:202 and SEQ ID NO: 1019 
SEQ ID NO.203 and SEQ ID NO: 1020 
SEQ ID NO:204 and SEQ ID NO: 102 1 
SEQ ID NO:205 and SEQ ID NO: 1022 
SEQ ID NO:207 and SEQ ID NO: 1023 
SEQ ID NO:208 and SEQ ID NO: 1024 
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SEQ ID NO:209 and SEQ ID NO: 1 025 
SEQ ID N0:2 10 and SEQ ID NO: 1026 
SEQ ID NO:2 1 1 and SEQ ID NO: 1 027 
SEQ ID NO:212 and SEQ ID NO: 1028 
SEQ ID NO:213 and SEQ ID NO: 1029 
SEQ ID NO:214 and SEQ ID NO: 1030 
SEQ ID NO:215 and SEQ ID NO: 1031 
SEQ ID NO: 2 1 6 and SEQ ID NO: 1 032 
SEQ ID NO:248 and SEQ ID NO: 1033 
SEQ ID NO:249 and SEQ ID NO: 1034 
SEQ ID NO:250 and SEQ ID NO:1035 
SEQ ID NO:251 and SEQ ID NO: 1036 
SEQ ID NO:252 and SEQ ID NO: 1037 
SEQ ID NO:253 and SEQ ID NO: 1038 
SEQ ID NO:254 and SEQ ID NO: 1039 
SEQ ID NO:255 and SEQ ID NO: 1040 
SEQ ID NO:256 and SEQ ID NO: 1041 
SEQ ID NO:257 and SEQ ID NO: 1 042 
SEQ ID NO:258 and SEQ ID NO: 1043 
SEQ ID NO:259 and SEQ ID NO: 1044 
SEQ ID NO:260 and SEQ ID NO: 1045 
SEQ ID NO: 261 and SEQ ID NO: 1046 
SEQ ID NO:262 and SEQ ID NO: 1047 
SEQ ID NO:263 and SEQ ID NO: 1048 
SEQ ID NO:264 and SEQ ID NO: 1049 
SEQ ID NO:265 and SEQ ID NO: 1050 
SEQ ID NO:266 and SEQ ID NO: 1051 
SEQ ID NO:267 and SEQ ID NO: 1052 
SEQ ID NO:268 and SEQ ID NO: 1053 
SEQ ID NO:291 and SEQ ID NO: 1054 
SEQ ID NO:292 and SEQ ID NO: 1055 
SEQ ID NO:293 and SEQ ID NO: 1056 
SEQ ID NO:294 and SEQ ID NO: 1057 
SEQ ID NO:295 and SEQ ID NO:1058 
SEQ ID NO:296 and SEQ ID NO: 1059 
SEQ ID NO:297 and SEQ ID NO: 1060 
SEQ ID NO:298 and SEQ ID NO: 1061 
SEQ ID NO:300 and SEQ ID NO: 1062 
SEQ ID NO:301 and SEQ ID NO: 1063 
SEQ ID NO:302 and SEQ ID NO: 1064 
SEQ ID NO:304 and SEQ ID NO: 1065 
SEQ ID NO:305 and SEQ ID NO: 1066 
SEQ ID NO: 306 and SEQ ID NO: 1067 
SEQ ID NO:307 and SEQ ID NO: 1068 
SEQ ID NO:308 and SEQ ID NO: 1069 
SEQ ID NO:309 and SEQ ID NO: 1070 
SEQ ID NO:310 and SEQ ID NO: 1071 
SEQ ID NO:31 1 and SEQ ID NO: 1072 
SEQ ID NO:312 and SEQ ID NO: 1073 
SEQ ID NO:313 and SEQ ID NO.1074 
SEQ ID NO:314 and SEQ ID NO: 1075 
SEQ ID NO:315 and SEQ ID NO: 1076 
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SEQ ID NO:317 and SEQ ID NO: 1077 
SEQ ID NO:318 and SEQ ID NO: 1 078 
SEQ ID NO:319 and SEQ ID NO: 1079 
SEQ ID NO:320 and SEQ ID NO: 1080 
SEQ ID NO:321 and SEQ ID NO:1081 
SEQ ID NO:322 and SEQ ID NO: 1082 
SEQ ID NO:323 and SEQ ID NO: 1083 
SEQ ID NO:324 and SEQ ID NO: 1084 
SEQ ID NO:325 and SEQ ID NO: 1085 
SEQ ED NO:326 and SEQ ID NO: 1086 
SEQ ID NO:327 and SEQ ID NO: 1087 
SEQ ID NO:328 and SEQ ID NO: 1088 
SEQ ID NO:329 and SEQ ID NO: 1089 
SEQ ID NO:330 and SEQ ID NO: 1090 
SEQ ID NO:331 and SEQ ID NO:1091 
SEQ ID NO:332 and SEQ ID NO: 1092 
SEQ ID NO:333 and SEQ ID NO: 1093 
SEQ ID NO:334 and SEQ ID NO: 1094 
SEQ ID NO:335 and SEQ ID NO: 1095 
SEQ ID NO:336 and SEQ ID NO: 1096 
SEQ ID NO:337 and SEQ ID NO: 1097 
SEQ ID NO:338 and SEQ ID NO: 1098 
SEQ ID NO:339 and SEQ ID NO: 1099 
SEQ ID NO:340 and SEQ ID NO: 1 100 
SEQ ID NO:341 and SEQ ID NO: 1 101 
SEQ ID NO:342 and SEQ ID NO: 1 102 
SEQ ID NO:343 and SEQ ID NO: 1 103 
SEQ ID NO:344 and SEQ ID NO: 1 104 
SEQ ID NO:345 and SEQ ID NO: 1 105 
SEQ ID NO: 347 and SEQ ID NO: 1 106 
SEQ ID NO:348 and SEQ ID NO: 1 107 
SEQ ID NO:349 and SEQ ID NO: 1 108 
SEQ ID NO: 350 and SEQ ID NO: 1 109 
SEQ ID NO:351 and SEQ ID NO:l 1 10 
SEQ ID NO:352 and SEQ ID NO: 1 1 1 1 
SEQ ID NO:353 and SEQ ID NO: 1 1 1 2 
SEQ ID NO:354 and SEQ ID NO: 1 1 13 
SEQ ID NO:355 and SEQ ID NO: 1 1 14 
SEQ ID NO: 356 and SEQ ID NO: 1 1 15 
SEQ ID NO:357 and SEQ ID NO: 1 1 16 
SEQ ID NO:358 and SEQ ID NO: 1 1 17 
SEQ ID NO:359 and SEQ ID NO: 1118 
SEQ ID NO:360 and SEQ ID NO: 1119 
SEQIDNO:361 and SEQ ID NO: 1120 
SEQ ID NO:362 and SEQ ID NO: 1 121 
SEQ ID NO:363 and SEQ ID NO: 1 1 22 
SEQ ID NO:364 and SEQ ID NO: 1 1 23 
SEQ ID NO:365 and SEQ ID NO: 1 1 24 
SEQ ID NO:366 and SEQ ID NO: 1 125 
SEQ ID NO:367 and SEQ ID NO: 1 126 
SEQ ID NO:368 and SEQ ID NO: 1 127 
SEQ ID NO:369 and SEQ ID NO: 1 128 
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SEQ ID NO:370 and 
SEQIDNO:371 and 
SEQ ID NO:372 and 
SEQ ID NO:373 and 
SEQ ID NO:374 and 
SEQ ID NO:375 and 
SEQ ID NO:376 and 
SEQ ID NO:377 and 
SEQ ID NO:378 and 
SEQ ID NO:379 and 
SEQ ID NO:380 and 
SEQ IDNO:381and 
SEQ ID NO:382 and 
SEQ ID NO:383 and 
SEQ ID NO:384 and 
SEQ ID NO:385 and 
SEQ ID NO:386 and 
SEQ ID NO:387 and 
SEQ ID NO:388 and 
SEQ ID NO:389 and 
SEQ ID NO: 390 and 
SEQIDNO:391 and 
SEQ ID NO: 392 and 
SEQ ID NO:393 and 
SEQ ID NO: 394 and 
SEQ ID NO:395 and 
SEQ ID NO: 396 and 
SEQ ID NO:397 and 
SEQ ID NO:398 and 
SEQ ID NO:399 and 
SEQ ID NO:400 and 
SEQ ID NO:401 and 
SEQ ID NO:402 and 
SEQ ID NO:403 and 
SEQ ID NO:404 and 
SEQ ID NO:405 and 
SEQ ID NO:406 and 
SEQ ID NO:407 and 
SEQ ID NO:408 and 
SEQ ID NO:409 and 
SEQ ID NO:410 and 
SEQ ID N0:411 and 
SEQ IDNO:412and 
SEQIDNO:413and 
SEQ ID NO:414 and 
SEQ IDNO:415 and 
SEQ ID NO:416and 
SEQ ID NO:417and 
SEQIDNO:419and 
SEQ ID NO:420 and 
SEQIDNO:421 and 
SEQ ID NO:422 and 



SEQ ID NO: 1129 
SEQ ID NO: 11 30 
SEQ ID NO: 1131 
SEQ ID NO: 1132 
SEQ ID NO: 1133 
SEQ ID NO: 1134 
SEQ ID NO: 1135 
SEQ ID NO: 11 36 
SEQ ID NO: 11 37 
SEQ ID NO: 11 38 
SEQ ID NO: 1139 
SEQ ID NO: 1140 
SEQ ID NO: 1141 
SEQ ID NO: 1142 
SEQ ID NO: 1143 
SEQIDNO:1144 
SEQ ID NO: 1145 
SEQ ID NO: 1146 
SEQIDNO:1147 
SEQ ID NO: 1148 
SEQ ID NO: 1149 
SEQ ID NO: 1150 
SEQIDNO:1151 
SEQ ID NO: 1152 
SEQ ID NO: 1153 
SEQ ID NO: 11 54 
SEQ ID NO: 1155 
SEQ ID NO: 11 56 
SEQ ID NO: 1157 
SEQ ID NO: 1158 
SEQ ID NO: 11 59 
SEQ ID NO: 1160 
SEQ ID NO:116l 
SEQIDNO:1162 
SEQ ID NO: 11 63 
SEQIDNO:1164 
SEQ ID NO: 1165 
SEQ ID NO: 1166 
SEQ IDNO:1167 
SEQ ID NO: 11 68 
SEQ ID NO: 1169 
SEQ ID NO: 1 170 
SEQ ID NO: 1171 
SEQ ID NO: 1172 
SEQ ID NO: 1173 
SEQ ID NO: 1174 
SEQ ID NO: 1175 
SEQ ID NO: 11 76 
SEQ ID NO: 1177 
SEQ ID NO: 11 78 
SEQ ID NO: 11 79 
SEQ ID NO: 1180 
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SEQ ID NO:423 and SEQ ID NO: 1 181 
SEQ ID NO:424 and SEQ ID N0:1 182 
SEQ ID NO:425 and SEQ ID NO: 1 183 
SEQ ID NO:426 and SEQ ID NO: 1 184 
SEQ ID NO:427 and SEQ ID NO: 1 185 
SEQ ID NO:428 and SEQ ID NO: 1 186 
SEQ ID NO:429 and SEQ ID NO: 1 187 
SEQ ID NO:430 and SEQ ID NO: 1 188 
SEQ ID NO:43 1 and SEQ ID NO: 1 189 
SEQ ID NO:432 and SEQ ID NO: 1 190 
SEQ ID NO:433 and SEQ ID NO: 1 191 
SEQ ID NO:434 and SEQ ID NO: 1 192 
SEQ ID NO:435 and SEQ ID NO: 1 193 
SEQ ID NO:436 and SEQ ID NO: 1 194 
SEQ ID NO:437 and SEQ ID NO: 1 195 
SEQ ID NO:438 and SEQ ID NO: 1 196 
SEQ ID NO:439 and SEQ ED NO: 1 197 
SEQ ID NO:440 and SEQ ID NO: 1 198 
SEQ ID NO:441 and SEQ ID NO:l 199 
SEQ ID NO:442 and SEQ ID NO: 1200 
SEQ ID NO:443 and SEQ ID NO: 1201 
SEQ ID NO:444 and SEQ ID NO: 1202 
SEQ ID NO:446 and SEQ ID NO: 1203 
SEQ ID NO:447 and SEQ ID NO: 1204 
SEQ ID NO:448 and SEQ ID NO: 1205 
SEQ ID NO:450 and SEQ ID NO: 1206 
SEQ ID NO:451 and SEQ ID NO: 1207 
SEQ ID NO:452 and SEQ ID NO: 1208 
SEQ ID NO:453 and SEQ ID NO: 1209 
SEQ ID NO:454 and SEQ ID NO: 1210 
SEQ ID NO:455 and SEQ ID NO: 12 11 
SEQ ID NO:456 and SEQ ID NO: 12 12 
SEQ ID NO:457 and SEQ ID NO: 1213 
SEQ ID NO:458 and SEQ ID NO: 1214 
SEQ ID NO:459 and SEQ ID NO: 1215 
SEQ ID NO:460 and SEQ ID NO: 1216 
SEQ ID NO:461 and SEQ ID NO: 1217 
SEQ ID NO:462 and SEQ ID NO: 1218 
SEQ ID NO:463 and SEQ ID NO: 1219 
SEQ ID NO:464 and SEQ ID NO: 1220 
SEQ ID NO:465 and SEQ ID NO: 1221 
SEQ ID NO:466 and SEQ ID NO: 1 222 
SEQ ID NO:467 and SEQ ID NO: 1223 
SEQ ID NO:468 and SEQ ID NO: 1224 
SEQ ID NO:469 and SEQ ID NO: 1225 
SEQ ID NO:470 and SEQ ID NO: 1226 
SEQ ID NO:471 and SEQ ID NO: 1227 
SEQ ID NO:472 and SEQ ID NO: 1228 
SEQ ID NO:473 and SEQ ID NO: 1229 
SEQ ID NO:474 and SEQ ID NO: 1230 
SEQ ID NO:475 and SEQ ID NO: 1231 
SEQ ID NO:476 and SEQ ID NO: 1232 
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SEQ ID NO:477 and 
SEQ ID NO:478 and 
SEQ ED NO:479 and 
SEQ ID NO:480 and 
SEQIDNO:481 and 
SEQ ID NO:482 and 
SEQ ID NO:483 and 
SEQ ID NO:484 and 
SEQ ID NO:485 and 
SEQ ED NO:486 and 
SEQ ID NO:487 and 
SEQ ID NO:488 and 
SEQ ID NO:489 and 
SEQ ID NO:490 and 
SEQ ID NO:492 and 
SEQ ID NO:493 and 
SEQ ID NO:494 and 
SEQ ID NO:496 and 
SEQ ID NO:497 and 
SEQ ID NO:498 and 
SEQ ID NO:499 and 
SEQ ID NO:500 and 
SEQ ID NO:501 and 
SEQ ID NO:502 and 
SEQ ID NO: 503 and 
SEQ ID NO:504 and 
SEQ ID NO:505 and 
SEQ ID NO:506 and 
SEQ ID NO:507 and 
SEQ ID NO:508 and 
SEQ ID NO:509 and 
SEQ IDNO:510and 
SEQ IDN0:511and 
SEQIDNO:513and 
SEQIDNO:514and 
SEQIDNO:515and 
SEQIDNO:516and 
SEQIDNO:517and 
SEQIDNO:518and 
SEQIDNO:519and 
SEQ ID NO:520 and 
SEQ ID NO:521 and 
SEQ ID NO: 522 and 
SEQ ID NO:523 and 
SEQ ID NO:524 and 
SEQ ID NO:525 and 
SEQ ID NO:526 and 
SEQ ID NO:527 and 
SEQ ID NO:528 and 
SEQ ID NO:529 and 
SEQ ID NO:530 and 
SEQ IDNO:531 and 



SEQ ID NO: 1233 
SEQ ID NO: 1234 
SEQ ID NO: 1235 
SEQ ID NO: 1236 
SEQ ID NO: 1237 
SEQ ID NO: 1238 
SEQ ID NO: 1239 
SEQ ED NO: 1240 
SEQ ID NO: 1241 
SEQ ED NO: 1242 
SEQ ED NO: 1243 
SEQ ID NO: 1244 
SEQ ID NO: 1245 
SEQ ID NO: 1246 
SEQ ID NO: 1247 
SEQ ID NO: 1248 
SEQ ID NO: 1249 
SEQ ID NO: 1250 
SEQ ID NO: 1251 
SEQ ID NO: 1252 
SEQ ID NO: 1253 
SEQ ID NO: 1254 
SEQ ID NO: 1255 
SEQ ID NO: 1256 
SEQ ID NO: 1257 
SEQ ID NO: 1258 
SEQ ID NO: 1259 
SEQ ED NO: 1260 
SEQ ID NO:1261 
SEQ ID NO: 1262 
SEQ ID NO: 1263 
SEQ ID NO: 1264 
SEQ ID NO: 1265 
SEQ ID NO: 1266 
SEQ ID NO: 1267 
SEQ ID NO: 1268 
SEQ ID NO: 1269 
SEQ ID NO: 1270 
SEQIDNO:1271 
SEQ ID NO: 1272 
SEQ ID NO: 1273 
SEQ ID NO: 1274 
SEQ ID NO: 1275 
SEQ ID NO: 1276 
SEQ ID NO: 1277 
SEQ ID NO: 1278 
SEQ ID NO: 1279 
SEQ ID NO: 1280 
SEQ ID NO: 1281 
SEQ ID NO: 1282 
SEQ ID NO: 1283 
SEQ ID NO: 1284 
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SEQ ID NO:532 and SEQ ID NO: 1285 
SEQ ID NO:533 and SEQ ID NO: 1286 
SEQ ID NO:534 and SEQ ID NO: 1287 
SEQ ID NO:535 and SEQ ID NO: 1288 
SEQ ID NO:536 and SEQ ID NO: 1 289 
SEQ ID NO:537 and SEQ ID NO: 1 290 
SEQ ID N0.538 and SEQ ID NO: 1291 
SEQ ID NO:539 and SEQ ID NO: 1292 
SEQ ID NO:540 and SEQ ID NO: 1293 
SEQ ID NO:541 and SEQ ID NO: 1294 
SEQ ID NO:542 and SEQ ID NO: 1295 
SEQ ID NO:543 and SEQ ID NO: 1296 
SEQ ID NO:544 and SEQ ID NO: 1297 
SEQ ID NO:545 and SEQ ID NO: 1298 
SEQ ID NO:546 and SEQ ID NO: 1299 
SEQ ID NO:547 and SEQ ID NO: 1300 
SEQ ID NO:548 and SEQ ID NO: 1301 
SEQ ID NO:549 and SEQ ID NO: 1302 
SEQ ID NO:550 and SEQ ID NO: 1303 
SEQ ID NO:551 and SEQ ID NO: 1304 
SEQ ID NO:552 and SEQ ID NO: 1305 
SEQ ID NO:553 and SEQ ID NO: 1306 
SEQ ID NO:554 and SEQ ID NO: 1307 
SEQ ID NO:555 and SEQ ID NO: 1308 
SEQ ID NO:556 and SEQ ID NO: 1309 
SEQ ID NO:557 and SEQ ID NO:1310 
SEQ ID NO:558 and SEQ ID NO: 1 3 1 1 
SEQ ID NO:559 and SEQ ID NO: 13 12 
SEQ ID NO:560 and SEQ ID NO: 13 13 
SEQ IDNO:561 and SEQ ID NO:1314 
SEQ ID NO:562 and SEQ ID NO:1315 
. SEQ ID NO:563 and SEQ ID NO: 1 3 16 
SEQ ID NO:564 and SEQ ID NO: 13 17 
SEQ ID NO:565 and SEQ ED NO: 13 18 
SEQ ID NO:566 and SEQ ID NO:1319 
SEQ ID NO:567 and SEQ ID NO: 1320 
SEQ ID NO:569 and SEQ ID NO: 1321 
SEQ ID NO:570 and SEQ ID NO: 1 322 
SEQ ID NO:571 and SEQ ID NO:1323 
SEQ ID NO:572 and SEQ ID NO: 1324 
SEQ ID NO:573 and SEQ ID NO: 1325 
SEQ ID NO:574 and SEQ ID NO: 1326 
SEQ ID NO:575 and SEQ ID NO: 1327 
SEQ ID NO:576 and SEQ ID NO: 1328 
SEQ ID NO:577 and SEQ ID NO: 1329 
SEQ ID NO:579 and SEQ ID NO: 1330 
SEQ ID NO:580 and SEQ ID NO: 1331 
SEQ ID NO:581 and SEQ ID NO:1332 
SEQ ID NO:582 and SEQ ID NO: 1333 
SEQ ID NO:583 and SEQ ID NO: 1334 
SEQ ID NO:584 and SEQ ID NO: 1335 
SEQ ID NO:585 and SEQ ID NO: 1336 
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SEQ ID NO:586 and 
SEQ ID NO:587 and 
SEQ ID NO:588 and 
SEQ ID NO:589 and 
SEQ ID NO:590 and 
SEQIDNO:591 and 
SEQ ID NO: 592 and 
SEQ ID NO:593 and 
SEQ ID NO: 594 and 
SEQ ID NO:595 and 
SEQ ID NO:596 and 
SEQ ID NO:597 and 
SEQ ID NO: 598 and 
SEQ ID NO:599 and 
SEQ ID NO:600 and 
SEQIDNO:601 and 
SEQ ID NO:602 and 
SEQ ID NO:603 and 
SEQ ID NO:604 and 
SEQ ID NO:605 and 
SEQ ID NO:606 and 
SEQ ID NO:607 and 
SEQ ID NO:608 and 
SEQ ID NO:609 and 
SEQ ID NO:6lOand 
SEQ ID N0:611 and 
SEQ IDN0:612and 
SEQ ID NO:613and 
SEQIDNO:614and 
SEQ ID NO:615and 
SEQ ID NO:616 and 
SEQ ID N0:617 and 
SEQ IDNO:618and 
SEQ ID N0:619 and 
SEQ ID NO:620 and 
SEQ ID NO:621 and 
SEQ ID NO:622 and 
SEQ ID NO:623 and 
SEQ ID NO:624 and 
SEQ ID NO: 625 and 
SEQ ID NO:626 and 
SEQ ID NO:627 and 
SEQ ID NO:628 and 
SEQ ID NO:629 and 
SEQ ID NO:630 and 
SEQIDNO:63I and 
SEQ ID NO:632 and 
SEQ ID NO:633 and 
SEQ ID NO: 634 and 
SEQ ID NO:635 and 
SEQ ID NO:636 and 
SEQ ID NO:637 and 



SEQ ID NO: 1337 
SEQ ID NO: 1338 
SEQ ID NO: 1339 
SEQ ID NO: 1340 
SEQ ID NO: 1341 
SEQ ID NO: 1342 
SEQ ID NO: 1343 
SEQ ID NO: 1344 
SEQ ID NO: 1345 
SEQ ID NO: 1346 
SEQ ID NO: 1347 
SEQ ID NO: 1348 
SEQ ID NO: 1349 
SEQ ID NO: 1350 
SEQ ID NO: 1351 
SEQ ID NO: 1352 
SEQ ID NO: 1353 
SEQ ID NO: 1354 
SEQ ID NO: 1355 
SEQ ID NO: 1356 
SEQ ID NO: 1357 
SEQ ID NO: 1358 
SEQ ID NO: 1359 
SEQ ID NO: 1360 
SEQIDNO:1361 
SEQ ID NO: 1362 
SEQ ID NO: 1363 
SEQ ID NO: 1364 
SEQ ID NO: 1365 
SEQ ID NO: 1366 
SEQ ID NO: 1367 
SEQ ID NO: 1368 
SEQ ID NO: 1369 
SEQ ID NO: 1370 
SEQ ID NO: 1371 
SEQ ID NO: 1372 
SEQ ID NO: 1373 
SEQ ID NO: 1374 
SEQ ID NO: 1375 
SEQ ID NO: 1376 
SEQ ID NO: 1377 
SEQ ID NO: 1378 
SEQ ID NO: 1379 
SEQ ID NO: 1380 
SEQIDNO:1381 
SEQ ID NO: 1382 
SEQ ID NO: 1383 
SEQ ID NO: 1384 
SEQ ID NO: 1385 
SEQ ID NO: 1386 
SEQ ID NO: 1387 
SEQ ID NO: 1388 
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SEQ ID NO:638 and SEQ ID NO: 1389 
SEQ ID NO:639 and SEQ ID NO: 1390 
SEQ ID NO:640 and SEQ ID NO: 1391 
SEQ ID NO:641 and SEQ ID NO: 1392 
SEQ ID NO:642 and SEQ ID NO: 1393 
SEQ ID NO:643 and SEQ ID NO: 1 394 
SEQ ID NO:644 and SEQ ID NO: 1395 
SEQ ID NO:645 and SEQ ID NO: 1396 
SEQ ID NO:646 and SEQ ID NO: 1 397 
SEQ ID NO:647 and SEQ ID NO: 1398 
SEQ ID NO:648 and SEQ ID NO: 1 399 
SEQ ID NO:649 and SEQ ID NO: 1400 
SEQ ID NO:650 and SEQ ID NO: 1401 
SEQ ID NO:651 and SEQ ID NO: 1402 
SEQ ID NO:652 and SEQ ID NO: 1403 
SEQ ID NO:653 and SEQ ID NO: 1404 
SEQ ID NO:654 and SEQ ID NO: 1405 
SEQ ID NO:655 and SEQ ID NO: 1406 
SEQ ID NO:656 and SEQ ID NO: 1407 
SEQ ID NO:657 and SEQ ID NO: 1408 
SEQ ID NO:658 and SEQ ID NO: 1409 
SEQ ID NO:659 and SEQ ID NO: 1410 
SEQ ID NO:660 and SEQ ID NO: 141 1 
SEQ ID NO-.661 and SEQ ID NO: 14 12 
SEQ ID NO:662 and SEQ ID NO: 1413 
SEQ ID NO:663 and SEQ ID NO: 14 14 
SEQ ID NO:664 and SEQ ID NO: 1415 
SEQ ID NO:665 and SEQ ID NO:1416 
SEQ ID NO:666 and SEQ ID NO: 1417 
SEQ ID NO:667 and SEQ ID NO: 14 18 
SEQ ID NO:668 and SEQ ID NO: 1419 
SEQ ID NO:669 and SEQ ID NO: 1420 
SEQ ED NO:670 and SEQ ID NO: 1421 
SEQ ID NO:671 and SEQ ID NO: 1422 
SEQ ID NO:672 and SEQ ED NO: 1423 
SEQ ID NO:673 and SEQ ID NO: 1424 
SEQ ID NO:674 and SEQ ID NO: 1425 
SEQ ID NO:675 and SEQ ID NO: 1426 
SEQ ID NO:676 and SEQ ID NO: 1427 
SEQ ID NO:677 and SEQ ID NO: 1428 
SEQ ID NO:678 and SEQ ID NO: 1429 
SEQ ID NO:679 and SEQ ID NO: 1430 
SEQ ID NO:680 and SEQ ID NO: 143 1 
SEQ ID NO:681 and SEQ ID NO: 1432 
SEQ ID NO:682 and SEQ ID NO: 1433 
SEQ ID NO:683 and SEQ ID NO; 1434 
SEQ ID NO:684 and SEQ ED NO: 1435 
SEQ ID NO:685 and SEQ ID NO: 1436 
SEQ ID NO:686 and SEQ ID NO: 1437 
SEQ ID NO:687 and SEQ ID NO: 1438 
SEQ ID NO:688 and SEQ ID NO: 1439 
SEQ ID NO: 689 and SEQ ID NO: 1440 
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SEQ ID NO:690 and SEQ ID NO: 1441 
SEQ ID NO:691 and SEQ ID NO: 1442 
SEQ ID NO:692 and SEQ ID NO: 1443 
SEQ ID NO:693 and SEQ ID NO: 1444 
SEQ ID NO:694 and SEQ ID NO: 1445 
SEQ ID NO:695 and SEQ ID NO: 1446 
SEQ ID NO:696 and SEQ ID NO: 1447 
SEQ ID NO:697 and SEQ ID NO: 1448 
SEQ ID NO:698 and SEQ ID NO: 1449 
SEQ ID NO:699 and SEQ ID NO: 1450 
SEQ ID NO:700 and SEQ ID NO: 1451 
SEQ ID NO:701 and SEQ ID NO: 1452 
SEQ ID NO: 702 and SEQ ID NO: 1453 
SEQ ID NO:703 and SEQ ID NO: 1454 
SEQ ID NO:704 and SEQ ID NO:1455 
SEQ ED NO:705 and SEQ ID NO: 1456 
SEQ ID NO:706 and SEQ ID NO: 1457 
SEQ ID NO:707 and SEQ ID NO: 1458 
SEQ ID NO:708 and SEQ ID NO: 1459 
SEQ ID NO:709 and SEQ ID NO: 1460 
SEQ ID NO:710 and SEQ ID NO: 1461 
SEQ ID NO:71 1 and SEQ ID NO: 1462 
SEQ ID NO:712 and SEQ ED NO: 1463 
SEQ ID NO:713 and SEQ ID NO: 1464 
SEQ ID NO-.714 and SEQ ID NO: 1465 
SEQ ID NO:715 and SEQ ID NO: 1466 
SEQ ID NO:716 and SEQ ID NO: 1467 
SEQ ID NO:717 and SEQ ID NO:1468 
SEQ ID NO:718 and SEQ ID NO: 1469 
SEQ ID NO:719 and SEQ ID NO: 1470 
SEQ ID NO: 720 and SEQ ID NO: 1471 
SEQ ID NO: 721 and SEQ ID NO: 1472 
SEQ ID NO:722 and SEQ ID NO: 1473 
SEQ ID NO:723 and SEQ ID NO: 1474 
SEQ ID NO:724 and SEQ ID NO: 1475 
SEQ ID NO:725 and SEQ ID NO: 1476 
SEQ ID NO:726 and SEQ ID NO: 1477 
SEQ ID NO:727 and SEQ ID NO: 1478 
SEQ ID NO:729 and SEQ ID NO: 1479 
SEQ ID NO:730 and SEQ ID NO: 1480 
SEQ ID NO:731 and SEQ ID NO: 1481 
SEQ ID NO:732 and SEQ ID NO:1482 
SEQ ID NO:733 and SEQ ID NO: 1483 
SEQ ID NO:734 and SEQ ID NO: 1484 
SEQ ID NO:735 and SEQ ID NO: 1485 
SEQ ID NO:736 and SEQ ID NO: 1486 
SEQ ID NO:737 and SEQ ID NO: 1487 
SEQ ID NO:738 and SEQ ID NO: 1488 
SEQ ID NO:739 and SEQ ID NO: 1489 
SEQ ID NO:740 and SEQ ID NO: 1490 
SEQ ID NO:741 and SEQ ID NO: 1491 
SEQ ID NO:745 and SEQ ID NO: 1492 
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SEQ ID NO:746 and 
SEQ ID NO:747 and 
SEQ ID NO:748 and 
SEQ ID NO:749 and 
SEQ ID NO: 750 and 
SEQ IDNO:751 and 
SEQ ID NO:752 and 
SEQ ID NO:753 and 
SEQ ID NO:754 and 
SEQ ID NO:755 and 
SEQ ID NO:756 and 
SEQ ID NO:757 and 
SEQ ID NO:758 and 
SEQ ID NO:759 and 
SEQ ID NO:760 and 
SEQ ID NO:761 and 
SEQ ID NO:762 and 
SEQ ID NO:763 and 
SEQ ID NO:764 and 
SEQ ID NO:765 and 
SEQ ID NO:766 and 
SEQ ID NO:767 and 
SEQ ID NO:768 and 
SEQ ID NO: 769 and 
SEQ ID NO:770 and 
SEQ ID NO:771 and 
SEQ ID NO:772 and 
SEQ ID NO:773 and 
SEQ ID NO:774 and 
SEQ ID NO: 775 and 
SEQ ID NO:776 and 
SEQ ID NO:777 and 
SEQ ID NO:778 and 
SEQ ID N0.779 and 
SEQ ID NO:780 and 
SEQ ID NO:78 1 and 
SEQ ID NO:782 and 
SEQ ID NO:783 and 
SEQIDNO:784 and 
SEQ ID NO:785 and 
SEQ ID NO:786 and 
SEQ ID NO:787 and 
SEQ ID NO:788 and 
SEQ ID NO:789 and 
SEQ ID NO:790 and 
SEQIDNO:791 and 
SEQ ID NO:792 and 
SEQ ID NO:793 and 
SEQ ID NO:794 and 
SEQ ID NO:795 and 
SEQ ID NO:797 and 
SEQ ID NO:798 and 



SEQ ID NO: 1493 
SEQ ID NO: 1494 
SEQ ID NO: 1495 
SEQ ID NO: 1496 
SEQ ID NO: 1497 
SEQ ID NO: 1498 
SEQ ID NO: 1499 
SEQ ID NO: 1500 
SEQ ID NO: 1501 
SEQ ID NO: 1502 
SEQ ID NO: 1503 
SEQ ID NO: 1504 
SEQ ID NO: 1505 
SEQ ID NO: 1506 
SEQ ID NO: 1507 
SEQ ID NO: 1508 
SEQ ID NO: 1509 
SEQ IDNO:1510 
SEQIDNO:1511 
SEQ ID NO: 15 12 
SEQ IDNO:1513 
SEQ ID NO:1514 
SEQ ID NO: 15 15 
SEQ ID NO: 15 16 
SEQ ID NO: 15 17 
SEQ ID NO: 15 18 
SEQ ID NO: 1519 
SEQ ID NO: 1520 
SEQ ID NO: 1521 
SEQ ID NO: 1522 
SEQ ID NO: 1523 
SEQ ID NO: 1524 
SEQ ID NO: 1525 
SEQ ID NO: 1526 
SEQ ID NO: 1527 
SEQ ID NO: 1528 
SEQ ID NO: 1529 
SEQ ID NO: 1530 
SEQ ID NO: 1531 
SEQ ID NO: 1532 
SEQ ID NO: 1533 
SEQ ID NO: 1534 
SEQ ID NO: 1535 
SEQ ID NO: 1536 
SEQ ID NO: 1537 
SEQ ID NO: 1538 
SEQ ID NO: 1539 
SEQ ID NO: 1 540 
SEQ ID NO: 1541 
SEQ ID NO: 1542 
SEQ ID NO: 1543 
SEQ ID NO: 1544 
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SEQ ID NO:799 and SEQ ID NO: 1 545 
SEQ ID NO:800 and SEQ ID NO: 1 546 
SEQ ID NO:801 and SEQ ID NO: 1547 
SEQ ID NO:802 and SEQ ID NO: 1 548 
SEQ ID NO:803 and SEQ ID NO: 1549 
SEQ ID NO: 804 and SEQ ID NO: 1550 
SEQ ID NO:805 and SEQ ID NO: 155 1 
SEQ ID NO: 806 and SEQ ID NO: 1552 
SEQ ID NO:807 and SEQ ID NO: 1 553 
SEQ ID NO: 809 and SEQ ID NO: 1554 
SEQ ID NO:810 and SEQ ID NO: 1555 
SEQ ID NO:8 1 1 and SEQ ID NO: 1556 
SEQ ID NO:812 and SEQ ID NO: 1557 
SEQ ID NO:813 and SEQ ID NO: 1558 
SEQ ID NO:814 and SEQ ID NO:1559 
SEQ ID N0.816 and SEQ ID NO: 1560 
SEQ ID NO:817 and SEQ ID NO:1561 
SEQ ID NO.-818 and SEQ ID NO:1562 
SEQ ID NO:819 and SEQ ID N0.1563 
SEQ ID NO:820 and SEQ ID NO: 1564 
SEQ ID NO:822 and SEQ ID NO: 1565 
SEQ ID NO:823 and SEQ ID NO: 1566 
SEQ ID NO:824 and SEQ ID NO: 1567 
SEQ ID NO:825 and SEQ ID NO: 1568 
SEQ ID NO:826 and SEQ ID NO: 1569 
SEQ ID NO:827 and SEQ ID NO: 1570 
SEQ ID NO:828 and SEQ ID NO: 1571 
SEQ ID NO.-829 and SEQ ID NO:1572, and 
SEO ID NO:2545 and SEO ID NO:2546 

A polynucleotide which can be synthesized with the primer set of claim 2 or 3. 

A polynucleotide comprising a coding region in the polynucleotide of claim 4. 

A substantially pure protein encoded by polynucleotide of claim 4. 

A partial peptide of the protein of claim 6. 

An isolated polynucleotide selected from the group consisting of 

e2« P°'>T c,eo,ide comprising a coding region of the nucleotide sequence set forth 
oLQ ID NOs: 
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SEQ ID NO:2547, SEQ ID NO.nnnn , SEQ ID NO:2551, SEQ ID NO:2553, SEQ ID NO:2555, 
SEQ ID NO-.2557, SEQ ID NO:2558, SEQ ID NO:2560, SEQ ID NO:2562, SEQ ID NO.2564, 
SEQ ID NO.2566, SEQ ID NO:2568, SEQ ID NO:2570, SEQ ID NO:2572, SEQ ID NO:2574, 
SEQ ID NO:2576, SEQ ID NO:2578, SEQ ID NO.2580, SEQ ID NO:2581, SEQ ID NO:2583, 
SEQ ID NO:2584, SEQ ID NO:nnnn , SEQ ID NO:2588, SEQ ID NO:2590, SEQ ID NO:2592, 
SEQ ID NO:2594, SEQ ID NO:2596, SEQ ID NO:2598, SEQ ID NO:nnnn , SEQ ID NO:2602, 
SEQ ID NO:2604, SEQ ID NO:2606, SEQ ID NO:2607, SEQ ID NO:2609, SEQ ID NO:261 1, 
SEQ ID NO:2613, SEQ ID NO:2615, SEQ ID NO:2617, SEQ IDNO:2619, SEQ ID NO:2621, 
SEQ ID NO-.2623, SEQ ID NO:2625, SEQ ID NO:2627, SEQ ID NO.2629, SEQ ID NO.2630, 
SEQ ID NO:2632, SEQ ID NO:2634, SEQ ID NO:2636, SEQ ID NO:2638, SEQ ID NO.2640, 
SEQ ID NO:2642, SEQ ID NO:2644, SEQ ID NO:2646, SEQ ID NO:2648, SEQ ID NO:2650, 
SEQ ID NO.2652, SEQ ID NO:2654, SEQ ID NO:2656, SEQ ID NO:2658, SEQ ID NO:2660, 
SEQ ID NO:2662, SEQ ID NO.2664, SEQ ID NO:2666, SEQ ID NO:2668, SEQ ID NO:2670, 
SEQ ID NO:2672, SEQ ID NO:2674, SEQ ID NO:2676, SEQ ID NO:2678, SEQ ID NO:2680, 
SEQ ID NO:2681, SEQ ID NO:2683, SEQ ID NO:2685, SEQ ID N0.2686, SEQ ID NO:2688, 
SEQ ID NO.2690, SEQ ID NO:2692, SEQ ID NO.2694, SEQ ID NO.2696, SEQ ID NO:2698, 
SEQ ID NO:2700, SEQ ID NO:2702, SEQ ID NO:2704, SEQ ID NO:2706, SEQ ID NO:2708, 
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SEQ ID NO:2710, SEQ ID NO:2712, SEQ ID NO:2714, SEQ IDNO:2715, SEQ ID N0 2717 
SEQ ID NO:2718, SEQ ID NO:2719, SEQ ID NO:272I, SEQ ID N02722 SEO ID KO-2TJA 
SEQ ID NO:2726, SEQ ID NO:2723, SEQ ID NO:2730, SEQ ID NO^ SEQ ID NO-2734' 
SEQ ID NO:2735, SEQ ID NO:2737, SEQ ID NO:2739, SEQ ID NOmnnn , SEQ ID NO 2743 
SEQ ID NO:2745, SEQ ID NO:2747. SEQ ID NO:2749, SEQ ID NO:2750 SEO ID NO 2752 ' 

33 S m? 2754 ' f Q 10 NO:2?56 ' SEQ m NO:2758 ' S£ Q 10 N0 ™> SEQ m NO-2 16 ' 
™ N0:nDDn 1 SEQ 10 NO:2765 > SE Q E> NO-2766, SEQ ID NO:2768, SEQ ID NO 2770 
SEQ ID NO:2772, SEQ ID NO.-2774, SEQ ID NO:2776, SEQ ID NO 2778 SEO ID NO2780 
SEQ ID NO:2782, SEQ ID NO:2784, SEQ ID N0378* SEQ ID NoS SEQ £ NO2790 
SEQ ID NO:2792, SEQ ID NO:2794, SEQ ID N0.2796, SEQ ID N0 2798 SEO ID NO2800 
SEQ m NO:2802, SEQ ID NO:2804, SEQ ID NO.2806 SEQ ID NoS SEQ £ SSSo 
SEQ ID NO-2812, SEQ ID NO:2814, SEQ ID NO:2816, SEQ ID NO:2818 SEQ m NO2820 
SEQ ID NO:2822, SEQ ID NO:2824, SEQ ID NO:2m, SEQ ID NO:2828 SEQ £ NO2830 
SEQ ID NO:2832, SEQ ID NO:2834, SEQ ID NO:2836, SEQ ID N02838 SEO ID NO : 284o' 
SEQ ID NO:2842, SEQ ID NO:2844. SEQ ID N0.2846 SEQ ID N02848 SEQ ID J£2» 
SEQ ID N02852, SEQ ID NO:2854, SEQ ID NO:2856 SEQ ID NO.-2858 SEQ ID NoS 
SEQ ID NO:2862, SEQ ID NO:2864, SEQ ID NO:2866, SEQ ID N0 2868 SEQ ID NO2870 
SEQ ID NO:2872, SEQ ID NO:2874, SEQ ID NO:2876 SEQ ID NO 2878, SEQ ID NO 2880 

S8 S tKJS' S ID N0 28W ' SEQ 10 N0 2886 ' SE Q ID Noi SEQ ID NO: 890 
SEQ ID NO-2892, SEQ ID NO:2894, SEQ ID NO:2896, SEQ ID NO:2898 SEO ID NO2900 
SEQ ID NO.2902, SEQ ID NO2904, SEQ ID NO2906. SEQ ID NO 2908 SEQ ID N029W 
SEQ ID NO:2912, SEQ ID NO.-2914, SEQ ID NO:2916. SEQ ID N0-2918 SEQ ID NO2920 
SEQ ID NO:2922, SEQ ID NO:2924, SEQ ID NO:2926 SEQ ID NO-2928 SEQ ID N^S' 
SEQ ID NO:2932, SEQ ID NO:2934, SEQ ID NO:2936 SEQ ID N0.2938 SEQ £ NoS 
SEQ ID NO:2942, SEQ ID NO:2944, SEQ ID NO:2946, SEQ ID NO:2948 SEQ ID NoHTo 
SEQ ID NO:2952, SEQ ID NO:2954, SEQ ID NO:2956, SEQ ID NO : 2958,' SEQ £ NO2960 
SEQ m N0.2962, SEQ ID NO.nnna , SEQ ID NO:2966, SEQ ID NO:2967 , SEQ ID N02969 
SEQ ID NO:2971, SEQ ID NO:2973, SEQ ID NO:2975, SEQ ID NO 2977 SEO ID NO-2979 
SEQ ID NO:2981, SEQ ID NO:2983, SEQ ID NO:2985 SEQ ID N02987 SEQ £ NoS' 
SEQ ID NO:2991, SEQ ID NO:2993, SEQ ID NO:2995 SEQ ID NO:2997 SEQ £ N02W 
SEQ ID NO:3001, SEQ ID NO:3003, SEQ ID NO3005 SEQ ID NO:3007' SEQ £ NO3009* 
SEQ ID NO:301 1, SEQ ID NO:3013, SEQ ID NO3015 SEQ ID N03M7 SEQ ID NO30^' 
SEQ ID NO:3021, SEQ ID NO:3023 ( SEQ ID NO:3025 SEQ ID NO-3027 SEO ID NoS' 

£ S :3 ° 31, SEQ 10 NO:3 ° 33 ' SE Q "> NO:3035, SEQ ID NO^?! SEQ ID NO3039 
SEQ ID NO:nnnn , SEQ ID NOrnnnn , SEQ ID NO:3045, SEQ ID NO.3047 SEO ID NO304Q 
SEQ ID NO:3051, SEQ ID NO:3053, SEQ ID NO:3055, SEQ ID NO:3057 SEO ID NO3059 
SEQ ID NO:3061, SEQ ID NO:3063, SEQ ID NO:3065 SEQ ID NO3067 SEQ £ NO'S 
SEQ ID N0:3071, SEQ ID NO.3073, SEQ ID NO:3075, SEQ ID NO3077 SEQ ID NO 3079* 
SEQ ID NO.-3081, SEQ ID NO:3083, SEQ ID NO:3085 SEQ ID NO 3087 SEQ £ NO-S 
SEQ ID NO:3091, SEQ ID NO:3093, SEQ ID NO3095, SEQ ID NO:3097 SEQ ID 
SEQ ID NO:3101, SEQ ID NO:3103, SEQ ID NO:3105, SEQ ID NO:3107 SEQ ID No'S 
SEQ ID NO:31 1 1, SEQ ID NO:31 13, SEQ ID NO:3115 SEQ ID N03 17 SEQ ID 
SEQ ID NO:3121, SEQ ID NO:3123, SEQ ID NO:3125 SEQ ID NO 3 27 SEQ £ N03 29' 
SEQ ID NO:3131, SEQ ID NO:3133, SEQ ID NO:3135, SEQ ID N03137 SEQ ID NO : 3 39 
SEQ ID NO:3140, SEQ ID NO:3142, SEQ ID NO:3143 SEQ ID N03144 SEQ ID N0^3 46* 
SEQ ID NO:3148, SEQ ID NO:3150, SEQ ID NO:3.52 SEQ ID NO 3 54* SEQ £ NOO 56 
SEQ ID NO : 3158, SEQ ID NO.nnnn , SEQ ID NO:3162, SEQ ID NO:3.M SEqS NO 3166 
SEQ ID N0.3168, SEQ ID NO.3170, SEQ ID NO:3172, SEQ ID NO:3174 SEO ID NO-3176 ' 
SEQ ID NO:3178, SEQ ID NO:3180, SEQ ID NO:3181, SEQ ID NO:3 m] SEQ D NO 3 85 
SEQ ID NO.3187, SEQ ID NO:3188, SEQ ID NO:3189, SEQ ID N0 3191 SEO ID NO 3 93 
SEQ ID NO:3195, SEQ ID NO:3196, SEQ ID NO:3198 SEQ ID NO 200 SEQ S ^3202 
SEQ ID NO3204, SEQ ID NO:32%. SEQ ID NO:3208, SEQ ID NO3210; SEQ ID NO mi 
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SEQ ID NO:32i4, SEQ ID NO:nnnn , SEQ ID N0:3218, SEQ ID NO:3219, SEQ ID NO:3221, 
SEQ ID NO:3222, SEQ ID NO:3224, SEQ ID NO:3226. SEQ ID NOrnnnn , SEQ ID NO.3230, 
SEQ ID NO:3232, SEQ ID NO:3234, SEQ ID NO:3236, SEQ ID NO:3238, SEQ ID NO:3240, 
SEQ ID NO:3242, SEQ ID NO:3244, SEQ ID NO:3246, SEQ ID NO:nnnn , SEQ ID NO:3250, 
SEQ ID NO:3252, SEQ ID NO:3254, SEQ ID NO:3256, SEQ ID NO:3258, SEQ ID NO:326(X 
SEQ ID NO:3262, SEQ ID NO:3264, SEQ ID NO:3266, SEQ ID NO:3268, SEQ ID NO:3270, 
SEQ ID NO:3272, SEQ ID NO:3274, SEQ ID NO:3276, SEQ ID NO:3278, SEQ ID NO:3280, 
SEQ ID NO:3282, SEQ ID NO:3284, SEQ ID NO:3286, SEQ ID NO:3288, SEQ ID NO:3290, 
SEQ ID NO:3292, SEQ ID NO:3294, SEQ ID NO:3296, SEQ ID NO:3298, SEQ ID NO:3300, 
SEQ ID NO:3301 , SEQ ID NO:3303, SEQ ID NO:3305, SEQ ID NO:3307, SEQ ID NO:3309, 
SEQ ID NO:3311, SEQ ID NO:3312, SEQ ID NO:3314, SEQ IDNO:3316, SEQ ID NO:3318 
SEQ ID NO:3320, SEQ ID NO:3322, SEQ ID NO:3324, SEQ ID NO:3326, SEQ ID NO:3328, 
SEQ ID NO:3330, SEQ ID NO:3332, SEQ ID NO:3334, SEQ ID NO:3336, SEQ ID NO:3338 > 
SEQ ID NO:nnnn , SEQ ID NO:3342, SEQ ID NO:3344, SEQ ID NO:3346, SEQ ID NO:3348, 
SEQ ID NO:3350, SEQ ID NO:3352, SEQ ID NO:3354, SEQ ID NO:3356, SEQ ID NO:3358, 
SEQ ED NO:3360, SEQ ID NO:3362, SEQ ID NO:3364, SEQ ID NO:3366, SEQ ID NO:3368, 
SEQ ID NO:3370, SEQ ID NO:3372, SEQ ID NO:3374, SEQ ID NO:3376, SEQ ID NO:3378, 
SEQ ID NO:3380, SEQ ED NO:3382, SEQ ID NO:3384, SEQ ED NO:3386, SEQ ID NO:3388, 
SEQ ID NO:3390, SEQ ID NO:3392, SEQ ID NO:3394, SEQ ID NO:3396 t SEQ ID NO:3398, 
SEQ ID NO:3400, SEQ ID NO:3402, SEQ ID NO:3404, SEQ ED NOinnnn , SEQ ID NO:3408, 
SEQ ID NO:3410, SEQ ID N0.34! 1, SEQ ED NO:3413, SEQ ID NO:3415, SEQ ID NO:3417, 
SEQ ID NO:3419, SEQ ID NO:nnnn , SEQ ID NO:3423, SEQ ID NO:3425, SEQ ID NO:3427, 
SEQ ID NO:3429, SEQ ID NO:3431, SEQ ID NO:3433, SEQ ID NO:3435, SEQ ID NO:3437, 
SEQ ID NO:3439, SEQ ID NO:3441, SEQ ID NO:3443, SEQ ID NO:3445, SEQ ID NO:3447, 
SEQ ID NO:3449, SEQ ID NO:3451, SEQ ID NO:3453, SEQ ID NO:3455, SEQ ID NO:3457> 
SEQ ID NO:3459, SEQ ID NO:3461, SEQ ID NO:3463, SEQ ID NO:3465, SEQ ID NO:3467, 
SEQ ID NO:3469, SEQ ID NO:3471, SEQ ID NO:3473, SEQ ID NO:3475, SEQ ID NO:3477, 
SEQ ID NO:3479 t SEQ ID NO.3481, SEQ ID NO:3482, SEQ ID NO:3484, SEQ ID NO:3486, 
SEQ ID NO:3488, SEQ ID NO.3490, SEQ ID NO:3492, SEQ ID NO:3494, SEQ ID NO:3496, 
SEQ ID NO:3498, SEQ ID NO:3500, SEQ ID NO:3502 t SEQ ID NO:3504, SEQ ID NO:3506, 
SEQ IDNO:3508, SEQ ID NO:3509, SEQ ID NO:351 1, SEQ ID NO:3513, SEQ ID NO:3515 
SEQ ID NO:3517, SEQ ID NO:3519, SEQ ID NO:3521, SEQ ID NO:3523, SEQ ID NO:3525, 
SEQ ID NO:3527, SEQ ID NO:3529, SEQ ID NO:3531, SEQ ED NO:3533, SEQ ID NO:3535, 
SEQ ID NO:3537, SEQ ID NO:3539, SEQ ID NO:3541, SEQ ID NO:3543, SEQ ID NO:3545, 
SEQ ID NO:3547, SEQ ID NO:3549, SEQ ID NO:3551, SEQ 3D NO:3553, SEQ ID NO: 3555, 
SEQ ID NO:3557, SEQ ID NO:3559, SEQ ID NO:3561, SEQ ID NO:3563, SEQ ID NO:nnnn , 
SEQ ID NO:3567, SEQ ID NO:3569, SEQ ID NO:3571, SEQ ID NO:3573, SEQ ID NO:3575, 
SEQ ID NO:3577, SEQ ID NO:3579, SEQ ID NO:3581, SEQ ID NO:3583, SEQ ID NO:3585> 
SEQ ID NO:3587, SEQ ID NO:3589 , SEQ ID NO:3595, SEQ ID NO:3597, SEQ ID NO:3599, 
SEQ ID NO:3601, SEQ ID NO:36C3, SEQ ID NO:3605, SEQ ID NO:3607, SEQ ID NO:3609, 
SEQ ID NO:3610, SEQ ID NO:361 1, SEQ ID NO:3612, SEQ ID NO:3613, SEQ ID NO:3615, 
SEQ ID NO:3617, SEQ ID NO:3619 y SEQ ID NO:3621, SEQ ID NO:3622, SEQ ID NO:3624. 
SEQ ID NO:3626, SEQ ID NO:3627, SEQ ID NO:3628, SEQ ID NO:3629, SEQ ID NO:3631 , 
SEQ ID NO:3633, SEQ ID NO:3634, SEQ ID NO:3636, SEQ ID NO:3638, SEQ ID NO:3640, 
SEQ ID NO:3642, SEQ ID NO:3644, SEQ ID NO:3645, SEQ ID NO:3647, SEQ ID NO:3649, 
SEQ ID NO:3651, SEQ ID NO:3653, SEQ ID NO:3655, SEQ ID NO:3657, SEQ ID NO:3658, 
SEQ ID NO:3660, SEQ ID NO:3662, SEQ ID NO:3664, SEQ ID NO:3666, SEQ ID NO:3668, 
SEQ ID NO:3670, SEQ ID NO:3672, SEQ ID NO:3674, SEQ ID NO:3676, SEQ ID NO:3678, 
SEQ ID NO:3680, SEQ ID NO:3682, SEQ ID NO:3683, SEQ ID NO:3685, SEQ ID NO:3686, 
SEQ ID NO:3688, SEQ ID NO:3690, SEQ ID NO:3692, SEQ ID NO:3694, SEQ ID NO:3696, 
SEQ ID NO:3698, SEQ ID NO.3700, SEQ ID NO:3702, SEQ ID NO:3704, SEQ ID NO:3706, 
SEQ ID NO:3708, SEQ ID NO:37].0 > SEQ ID NO:3712, SEQ ID NO:3714, SEQ ID NO:3716, 
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SEQ ID NO:3718, SEQ ID NO.3720, SEQ ID NO:3722, SEQ ID NO:3724, SEQ ID N03726 
SEQ ID NO:3728, SEQ ID NO:3730, SEQ ID N03732, SEQ ID NO:3734, SEQ ID N03736 
SEQ ID NO:3738, SEQ ID NO:3740, SEQ ID NO:3741, SEQ ID NO:3743, SEQ ID NO : 3745' 
SEQ ID NO.-3747, SEQ ID NO:3749, SEQ ID NO:3751, SEQ ID NO:3753, SEQ ID N03755' 
SEQ ID NO:3756, SEQ ID N0.3758, SEQ ID NO:3760, SEQ ID NO:3761 SEQ ID N0 3763' 
SEQ ID NO.-3765, SEQ ID NO.3767, SEQ ID NO:3769, SEQ ID NO:3771,' SEQ ID N03773' 
SEQ ID NO:3775, SEQ ID NO:3777 

SEQ ID NO:3779, SEQ ID NO:3781, SEQ ID NO:3783, SEQ ID NO:3785, SEQ ID N0 3787 
SEQ ID NO:3789, SEQ ID NO:3791 , SEQ ID NO:3793, SEQ ID NO:3795 SEQ ID NO-3797' 
SEQ ID NO.3799, SEQ ID NO:3801, SEQ ID NO:3803, SEQ ID NO:3805! SEQ ID NoW 
SEQ ID NO.-3808, SEQ ID NO:3810, SEQ ID NO:3812. SEQ ID NO:3814 SEQ ID NO-3816' 
SEQ ID NO:38l8, SEQ ID NO:3820, SEQ ID NO:3822, SEQ ID NO:3824, SEQ ID N03826 
SEQ ID N0.3828, SEQ ID NO.3830, SEQ ID NO:3832, SEQ ID NO:3834, SEQ ID NO : 3836 
SEQ ID NO:3838, SEQ ID NO:3840, SEQ ID NO:3842, SEQ ID NO:3844, SEQ ID NO-3846 
SEQ ID NO:3848, SEQ ID NO:3852, SEQ ID NO:3854, SEQ ID NO:3856, SEQ ID N03858 
SEQ ID NO:3860, SEQ ID NO:3862, SEQ ID NO:3864, SEQ ID NO:3866, SEQ ID N03868' 
SEQ ID NO:3870, SEQ ID NO:3872, SEQ ID NO:3874, SEQ ID NO:3876, SEQ ID N03878' 
SEQ ID NO:3880, SEQ ID NO:3882, SEQ ID NO:3884, SEQ ID NO:3886, SEQ ID N03888' 
SEQ ID NO:3890, SEQ ID NO:3892, SEQ ID NO:3894, SEQ ID NO:3896, SEQ ID NO : 389s' 
SEQ ID NO:3900, SEQ ID NO:3902, SEQ ID NO.3904, SEQ ID NO:3906 SEQ ID NO 3908' 
SEQ ID NO:3910, SEQ ID NO:3912, SEQ ID NO:3914, SEQ ID NO:3916, SEQ ID NO : 3918' 
SEQ ID NO:3920, SEQ ID NO:3922, SEQ ID NO:3924, SEQ ID NO:3926, SEQ ID NO^s' 
SEQ ID NO:3930, SEQ ID NO:393 2, SEQ ID NO:3934, SEQ ID NO:3936 SEQ ID NO : 3938' 
SEQ ID NO:3940, SEQ ID NO:3942, SEQ ID NO:3944, SEQ ID NO:3946,' SEQ ID NO : 3948' 
SEQ ID NO:3950, SEQ ID NO:3952, SEQ ID NO:3954, SEQ ID NO:3956 SEQ ID NO : 3958* 
SEQ ID NO-.3960, SEQ ID NO:3962, SEQ ID NO:3964, SEQ ID NO:3965' SEQ ID NO" 3966' 
SEQ ID NO:3968, SEQ ID NO:3970, SEQ ID NO.3972, SEQ ID NO:3974, SEQ ID NoW 
SEQ ID NO:3977, SEQ ID NO:3979, SEQ ID NO:3981, SEQ ID NO:3983, SEQ ID N03985 
ofS S NO:3987 ' SE< 3 m NO:3988, SEQ ID NO:3990, SEQ ID NO:3992, SEQ ID Na3994 
SEQ ID NO:3996, SEQ ID NO:3998, SEQ ID NO:4000, SEQ ID NO:4001, SEQ ID NO4003' 
SEQ ID NO.-4004, SEQ ID NO:4006, SEQ ID NO:4008, SEQ ID NO:4010 SEQ ID NO : 4012' 
SEQ ID NO:4016, SEQ ID NO:4018, SEQ ID NO:4020, SEQ ID NO.-4022' SEQ ID NO4024 
SEQ ID NO:4026, SEQ ID NO:4028, SEQ ID NO:4030, SEQ ID NO:4032' SEQ ID NO-4034 
SEQ ID NO:4036, SEQ ID NO:4038, SEQ ID NO.-4040, SEQ ID NO:4042' SEQ ID NO : 4044' 
SEQ ID NO:4046, SEQ ID NO:4048, SEQ ID NO:4050, SEQ ID NO:4052 SEQ ID NO4054 
SEQ ID NO:4056, SEQ ID NO:4058, SEQ ID NO:4060, SEQ ID NO:4062' SEQ ID NO : 4064 
SEQ ID NO:4066, SEQ ID NO:4068, SEQ ID NO:4070, SEQ ID NO:4072' SEQ ID NO4074 
SEOIDNO:4076 V «w.w/i. 



SEQ ID NO:4078, SEQ ID NO:4080, SEQ ID NO:4082, SEQ ID NO:4084, SEQ ID NO 4086 
SEQ ID NO:4088, SEQ ID NO:4090, SEQ ID NO:4092, SEQ ID NO:4094 SEQ ID NO ! 4096' 
SEQ ID NO:4098, SEQ ID NO:4099, SEQ ID Ntt4101, SEQ ID NO:4103' SEQ ID NO : 4105' 
SEQ ID NO:4 107, SEQ ID NO:4109, SEQ ID NO:41 1 1, SEQ ID NO:41 13* SEQ ID NO^l 15 
SEQ ID NO:41 17, SEQ ID NO:41 19, SEQ ID NO:4121 , SEQ ID NO:4123 SEQ ID N04125' 
SEQ ID NO:4127, SEQ ID NO:4129, SEQ ID NO:4131, SEQ ID NO:4132, SEQ ID NO-4134 
SEQ ID N0:4136, SEQ ID NO:4137, SEQ ID NO:4139, SEQ ID NO:4140 SEQ ID NO-4142 
SEQ ID N0:4144, SEQ ID NO:4146, SEQ ID NO:4148, SEQ ID NO:415o' SEO ID N04152 
SEQ ID NO:4154, SEQ ID NO:4155, SEQ ID NO:4157, SEQ ID NO:4159' SEO ID NO : 416l' 
SEQIDNO:4163,SEQIDNO:4165 ' ' 

SEQ ID NO:4167, SEQ ID NO:4168, SEQ ID NO:4170, SEQ ID NO:4172 SEO ID NO 4174 
SEQIDNO:4176,andSEQIDNO:4178 ' Ul ™* l <«> 

(b) a polynucleotide comprising a nucleotide sequence encoding a protein comprising the amino acid sequence 
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set forth in any one of the following SEQ ID NOs: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 
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SEQ ID NO:2548, SEQ ID NO.nnnn , SEQ ID NO:2552, SEQ ID N0.2554 SEQ ID N02556 
SEQ ID NO:2559, SEQ ID NO:2561, SEQ ID NO:2563, SEQ ED NO:2565, SEQ ID NO-2567 ' 
SEQ ID N0.2569, SEQ ID N0.2571, SEQ ID NO:2573, SEQ ID NO:2575 SEQ ID N02577 
SEQ ID NO:2579, SEQ ID NO:2582, SEQ ID NO:2585, SEQ ID NO:nnnn SEQ ID NO 2589 

2£ £ M? 2591 ' SEQ ID NO:2593 > SEQ m NO:2595 ' SE Q NoS.SEQ ID NO-25W 
SEQ ID NO:nnnn , SEQ ID NO:2603, SEQ ID NO:2605, SEQ ID NO:2608, SEQ ID NO-2610 
SEQ ID NO:2612, SEQ ID NO:2614, SEQ ID NO:2616, SEQ ID NO:2618 SEQ ID NO 2620 
SEQ ID NO:2622, SEQ ID NO:2624, SEQ ID NO:2626, SEQ ID N0.2628, SEQ ID NO : 2631 ' 
SEQ ID NO:2633, SEQ ID NO:2635, SEQ ID NO:2637, SEQ ID NO:2639 SEQ £ NoX 
SEQ ID NO.2643, SEQ ID N0.2645, SEQ ID NO.2647, SEQ ID NO:2649 SEQ ID NO 2S ' 
SEQ ID NO:2653, SEQ ID NO:2655, SEQ ID NQ2657 SEQ ID NO 2659 SEQ ID NO 266 ' 
SEQ ID NO:2663, SEQ ID NO:2665, SEQ ID NO:2667, SEQ ID NO:2669 SEQ ID N0267]' 
SEQ ID NO:2673, SEQ ID NO:2675, SEQ ID NO:2677, SEQ ID N0 2679 SEQ £ N^2682 
SEQ ID NO:2684, SEQ ID NO:2687, SEQ ID NO:2689, SEQ ID NO:2691 SEQ ID N02693 
SEQ ffi NO.2695, SEQ ID NO:2697, SEQ ID NO.2699 SEQ ID NO:270 \] SEQ £ NoS 
SEQffi NO:2705, SEQ ID NO2707, SEQ IDNO2709, SEQ ID NO:271 1 SEQ ID NO 27 13 

SEO ID ™2£ S In ISS* f Q » N ° :2723 ' ^ 10 NO:2725 ' ® * S 
SEQ IDNO:2729, SEQ ID NO:2731, SEQ ID NO:2733, SEQ ID N0 2736 SEO ID NO-2738 

SEQ D NO2740, SEQ ID NO:nnnn . SEQ ID NO:2744, SEQ ID NO:2746, SEQ ID NO 2748 

SEO ID N02 2 762' S £ SK* N ° :2?55 ' $EQ ID ^^SE^nSSS' 

SEQ ID NO.2762, SEQ ID NO.nnnn , SEQ ID NO:2767, SEQ ID NO:2769 SEO ID NO-2771 

SEQ ID NO:2773, SEQ ID NO:2775 t SEQ ID NO:2777, SEQ ID N0 2779 SEO ID NO-2^81 

SEQ ID NO:2783, SEQ ID NO:2785, SEQ ID NO:2787 SEQ ID NO 2789 SEQ ID N^279 " 

SEQ m NO:2793, SEQ ID NO:2795, SEQ ID NO:2797, SEQ ID NO:2799 SEQ £ NO280 ' 

SEQ ID NO:2803, SEQ ID NO:2805, SEQ ID NO:2807, SEQ ID NO:2809 SEQ ID N028U 

SEQ © NO:2813, SEQ ID N02815. SEQ ID NO:2817, SEQ ID NO^S SEQ ID N0282 * 

SEQ ID NO:2823, SEQ ID NO:2825, SEQ ID NO:2827, SEQ ID NO:2829 SEO ID NO 283 ' 

SEQ ID N02833, SEQ ID NO:2835, SEQ ID N0.2837 SEQ ID NO 2839 SEQ £ NO 284 ' 

SEQ ID NO:2843. SEQ ID NO:2845, SEQ ID NO:2847 SEQ ID NO 2849 SEQ £ N0285 ' 

SEQ ID NO:2853, SEQ ID NO:2855, SEQ ID NO:2857 SEQ ID N02859 SEO £ N0286 ' 

SEQ ID NO:2863, SEQ ID NO:286:5, SEQ ID NO:2867 SEQ ID NO:2869 SEQ £ mwi ' 

SEQ ID NO:2873, SEQ ID NO:2875, SEQ ID NO:2877 SEQ ID N0 2879 SEO £ NO 2 88 ' 

SEQ ID NO:2883, SEQ ID NO:2885, SEQ ID NO:2887 SEQ ID N02889 SEQ £ N0289 ' 

SEQ ID NO:2893, SEQ ID NO:2895, SEQ ID NO:2897, SEQ ID NO:2899 SEQ ID N02W ' 

SEQ ID NO2903, SEQ ID NO:2905, SEQ ID NO:2907, SEQ ID NO^! SEQ ID NO^ ' 

SEQ ID NO:2913, SEQ ID NO:2915, SEQ ID NO:2917, SEQ ID NO:29I9 SEQ £ N0292 ' 

SEQ ID NO:2923, SEQ ID NO:2925, SEQ ID NO:2927 SEQ ID NO:2929 SEQ £ NO 293 ' 

SEQ ID NO:2933, SEQ ID NO:2935, SEQ ID NO:2937, SEQ ID NO:2939 SEQ £ N0294 ' 

SEQ ID NO:2943, SEQ ID NO:2945, SEQ ID NO:2947 SEQ ID NO 2949 SEO £ NO 295 ' 

SEQ ID NO:2953, SEQ ID NO:2955, SEQ ID NO:2957 SEQ ID NO 2959 SEQ £ S' 

SEQ ID NO:2963. SEQ ID NO.nnnn , SEQ ID NO:2968 SEQ ID NO2970. SEQ ID NO 2972 

SEQ ID NO:2974, SEQ ID NO:2976, SEQ ID NO:2978. SEQ ID NO2980 SEQ ID N02982 

SEQ ID NO:2984, SEQ ID NO:2986, SEQ ID NO:2988, SEQ ID NO2990 SEQ £ N02992* 

SEQ ID N0:2994, SEQ ID NO:2996, SEQ ID NO:2998, SEQ ID NO-300o' SEO £ NO3002 

SEQ ID NO.-3004, SEQ ID NO:3006, SEQ ID NO:3008 SEQ ID nS SEQ £ J£5£ 

SEQ ID NO3014, SEQ ID NO:3016, SEQ ID NO3018, SEQ ID NO 3020 SEO £ NO-3022 

SEQ ID NO.-3024, SEQ ID NO.3026, SEQ ID NO:3028 SEQ ID NO: 3030 SEQ £ m ml 

«S £ S' 3034 ' ?^^ 0:3036 ' SEQ ID NO:3038 ' SE Q ID N O:3040 SEQ £ No! 1' 
SEQ ID NO:nnnn , SEQ ID NO:30 t 6, SEQ ID NO.3048, SEQ ID NO:3050, SEQ ID NO 3052' 

SEQ ID NO:3054, SEQ ID NO:3056, SEQ ID NO:3058, SEQ ID NO 3060 SEO ID NO3062 
SEQ ID NO:3064, SEQ ID NO3066, SEQ ID NO^! SEQ ID K^Z] SEQ £ SS» 
SEQ ID NO:3074, SEQ ID NO:3076. SEQ ID NO:3078, SEQ ID NO3080 SEQ ID NO3082 



1374 



EP 1 130 094 A2 



SEQ ID NO:3084, SEQ ID NO:3086, SEQ ID NO:3088, SEQ ID NO:3090, SEQ ID NO:3092, 
SEQ ID NO:3094, SEQ ID NO:3096, SEQ ID NO:3098, SEQ ID NO:3 100, SEQ ID NO:3102, 
SEQ ID NO:3 104, SEQ ID NO:3 106, SEQ ID NO:3108, SEQ ID NO:3 1 10, SEQ ID NO:31 12, 
SEQ ID NO:3 1 14, SEQ ID NO:3 1 1 6, SEQ ID NO:3 1 1 8, SEQ ID NO:3 1 20, SEQ ID NO:3 1 22, 
SEQ ID NO:3124, SEQ ID NO:3126, SEQ ID NO:3128, SEQ ID NO:3130, SEQ ID NO:3132, 
SEQ ID NO:3134, SEQ ID NO:3136, SEQ ID NO:3138, SEQ ID NO:3141, SEQ ID NO:3145, 
SEQ ID NO:3147, SEQ ID NO:3149, SEQ ID NO:3151, SEQ ID NO:3153, SEQ ID NO:3155, 
SEQ IDNO:3157, SEQ ID NO:3l59, SEQ ID NO:nnnn , SEQ ID NO:3163, SEQ ID NO:3165 
SEQIDNO:3167, SEQ ID NO:3169, SEQ EDNO:3171, SEQ ID NO:3173, SEQ ID NO:3175, 
SEQ ID NO:3177, SEQ ID NO:3179, SEQ ID NO:3182, SEQ ID NO:3184, SEQ ID NO:3186, 
SEQ TO NO:3190, SEQ ID NO:3192, SEQ ID NO:3194, SEQ ID NO:3197, SEQ ID NO:3199, 
SEQ ID NO:3201, SEQ ID NO:3203, SEQ ID NO:3205, SEQ ID NO:3207, SEQ ID NO:3209, 
SEQ ED NO:321 1, SEQ ID NO:3213, SEQ ID NO:3215, SEQ ID NO:nnnn , SEQ ID NO:3220, 
SEQ ID NO:3223, SEQ ID NO:3225, SEQ ID NO:3227, SEQ ID NO:nnnn , SEQ ID NO:3231, 
SEQ ID NO:3233, SEQ ID NO:3235, SEQ ID NO:3237, SEQ ID NO:3239, SEQ ID NO:3241, 
SEQ ID NO:3243, SEQ ID NO:3245 f SEQ ID NO:3247, SEQ ID NO:nnnn , SEQ ID NO:325L 
SEQ ID NO:3253, SEQ ID NO:3255, SEQ ID NO:3257, SEQ ID NO:3259, SEQ ID NO:3261, 
SEQ ID NO:3263, SEQ ID NO:3263, SEQ ID NO:3267, SEQ ID NO:3269, SEQ ID NO:3271, 
SEQ ID NO:3273 T SEQ ID NO:327!>, SEQ ED NO:3277, SEQ ID NO:3279, SEQ ID NO:3281, 
SEQ ID NO:3283, SEQ ID NO:3285, SEQ ID NO:3287, SEQ ID NO:3289, SEQ ID NO:3291, 
SEQ ID NO:3293, SEQ ID NO:3295, SEQ ID NO:3297, SEQ ID NO:3299, SEQ ID NO:3302, 
SEQ ID NO:3304, SEQ ID NO:330<5, SEQ IDNO:3308, SEQ ID NO:3310, SEQ ID NO:3313, 
SEQ ID NO:3315, SEQ ID NO:3317, SEQ ID NO:3319, SEQ ID NO:3321, SEQ ID NO:3323, 
SEQ ID NO:3325, SEQ ED NO:3327, SEQ ID NO:3329, SEQ ID NO:333 1 , SEQ ID NO:3333, 
SEQ ID NO:3335, SEQ ID NO:3337, SEQ ID NO:3339, SEQ ID NO:nmin , SEQ ID NO:3343 5 
SEQ ID NO:3345, SEQ ID NO:3347, SEQ ID NO:3349, SEQ ID NO:3351, SEQ ID NO:3353, 
SEQ ID NO:3355, SEQ ID NO:3357, SEQ ID NO:3359, SEQ ID NO:3361, SEQ ID NO:3363, 
SEQ ID NO:3365, SEQ ID NO:3367, SEQ ID NO:3369, SEQ ID NO:3371, SEQ ID NO:3373 
SEQ ID NO:3375, SEQ ID NO:3377, SEQ ID NO:3379, SEQ ID NO:3381, SEQ ID NO:3383, 
SEQ ID NO:3385, SEQ ID NO:3387, SEQ ID NO:3389, SEQ ID NO:3391, SEQ ID NO:3393, 
SEQ ID NO:3395, SEQ ID NO:3397, SEQ ID NO:3399, SEQ ID NO:3401, SEQ ID NO:3403, 
SEQ ID NO:3405, SEQ ID NO:nnnn , SEQ ID NO:3409, SEQ ID NO:3412, SEQ ID NO:3414 ? 
SEQ ID NO:3416, SEQ ID NO:3413, SEQ ID NO:3420, SEQ ID NO:nnnn , SEQ ID NO:3424, 
SEQ ID NO:3426, SEQ ID NO:3428, SEQ ID NO:3430, SEQ ID NO:3432, SEQ ID NO:3434, 
SEQ ID NO:3436, SEQ ID NO:3438, SEQ ID NO:3440, SEQ ID NO:3442, SEQ ID NO:3444, 
SEQ ID NO:3446, SEQ ID NO:3448, SEQ ID NO:3450, SEQ ID NO:3452, SEQ ID NO:3454, 
SEQ ID NO:3456, SEQ ID NO:3458, SEQ ID NO:3460, SEQ ID NO:3462, SEQ ID NO:3464, 
SEQ ID NO:3466. SEQ ED NO:3468, SEQ ID NO:3470, SEQ ID NO:3472, SEQ ID NO:3474, 
SEQ ID NO:3476, SEQ ID NO:3478, SEQ ID NO:3480, SEQ ID NO:3483, SEQ ID NO:3485, 
SEQ ID NO:3487, SEQ ID NO:3489, SEQ ID NO:3491, SEQ ID NO:3493, SEQ ID NO:3495, 
SEQ ID NO:3497, SEQ ID NO:3499, SEQ ID NO:3501 , SEQ ID NO.3503, SEQ ID NO:3505, 
SEQ ID NO:3507, SEQ ED NO:3510, SEQ ID NO:3512, SEQ ID NO:3514, SEQ ID NO:3516, 
SEQ ID NO:3518, SEQ ID NO:3520, SEQ ID NO:3522, SEQ ID NO:3524, SEQ ID NO:3526, 
SEQ ID NO:3528, SEQ ID NO:3530, SEQ ID NO:3532, SEQ ID NO:3534, SEQ ID NO:3536, 
SEQ ID NO:3538, SEQ ID NO:3540, SEQ ID NO:3542, SEQ ED NO:3544, SEQ ID NO:3546, 
SEQ ED NO:3548, SEQ ID NO:3550, SEQ ID NO:3552, SEQ ID NO:3554, SEQ ID NO:3556, 
SEQ ED NO:3558, SEQ ID NO:3560, SEQ ID NO:3562, SEQ ID NO:3564, SEQ ID NO:nnnn , 
SEQ ID N0.3568, SEQ ID NO:3570, SEQ ID NO:3572, SEQ ID NO:3574, SEQ ID NO:3576 
SEQ ID NO:3578, SEQ ID NO:3580, SEQ ID NO:3582 ? SEQ ID NO:3584, SEQ ID NO:3586, 
SEQ ID NO:3588, SEQ ID NO:3590 

SEQ ID NO:3596, SEQ ID NO:3598, SEQ ID NO:3600, SEQ ID NO:3602, SEQ ID NO:3604, 
SEQ ID NO:3606, SEQ ID NO:3608, SEQ ID NO:3614, SEQ ID NO:3616, SEQ ID NO:3618, 
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SEQ ID NO:3620, SEQ ID NO:3623, SEQ ID NO:3625, SEQ ID NO:3630, SEQ ID NO-3632 
SEQ ID NO:3635, SEQ ID NO:3637, SEQ ID N0.3639, SEQ ID NO:3641, SEQ ID NO 3643* 
SEQ ID NO:3646, SEQ ID NO:3648, SEQ ID NO:3650, SEQ ID NO:3652 SEQ ID N03654 
SEQ ID NO:3656, SEQ ID NO:3659, SEQ ID NO:3661, SEQ ID NO:3663, SEQ ID NO 3665* 
SEQ ID NO:3667, SEQ ID NO:3669, SEQ ID NO:3671, SEQ ID NO:3673 SEQ ID NO-3675 
SEQ ID N0.3677, SEQ ID NO:3679, SEQ ID NO:3681, SEQ ID N0.3684, SEQ ID N03687' 
SEQ ID NO:3689, SEQ ID NO:3691, SEQ ID N0.3693, SEQ ID N0.3695, SEQ ID NO-3697 
SEQ ID NO:3699, SEQ ID NO:3701, SEQ ID NO:3703, SEQ ID NO:3705 SEQ ID NO-3707 
SEQ ID NO3709, SEQ ID NO:371 1, SEQ ED NO:3713, SEQ ID NO:3715, SEQ ID NO^H 
SEQ ID NO:3719, SEQ ID NO:372 1, SEQ ID NO:3723, SEQ ID NO:3725. SEQ ID NO : 3727' 
SEQ ID NO:3729, SEQ ID NO:373 1, SEQ ID NO:3733, SEQ ID NO:3735 SEQ ID NO-W 
SEQ ID NO:3739, SEQ ID NO:3742, SEQ ID NO:3744, SEQ ID NO:3746 SEQ ID NO : 3748' 
SEQ ID NO:3750, SEQ ID NO.3752, SEQ ID NO.3754, SEQ ID NO:3757' SEQ ID NO : 3759' 
SEQ ID NO:3762, SEQ ID NO:3764, SEQ ID NO:3766, SEQ ID NO:3768, SEQ ID NO-377o' 
SEQ ID NO:3772, SEQ ID NO:3774, SEQ ID NO:3776, SEQ ID NO:3778, SEQ ID NO : 378o' 
SEQ ID NO:3782, SEQ ID NO:3784, SEQ ID NO:3786, SEQ ID NO:3788, SEQ ID N0 379o' 
SEQ ID NO:3792, SEQ ID NO:3794, SEQ ID NO.-3796, SEQ ID NO:3798, SEQ ID NO^SOo' 
SEQ ID NO:3802, SEQ ID NO:3804, SEQ ID NO:3807, SEQ ID NO:3809, SEQ ID NO : 381l' 
SEQ ID NO:3813, SEQ ID NO:38l5, SEQ ID NO:38I7, SEQ ID NO:3819 SEQ ID NO : 3821 
SEQ ID NO:3823, SEQ ID NO:3825, SEQ ID NO:3827, SEQ ID NO:3829 SEQ ID NO-383I' 
SEQ ID NO:3833, SEQ ID NO:3835, SEQ ID NO:3837, SEQ ID NO:3839 SEQ ID NO : 384l' 
SEQ ID NO:3843, SEQ ID NO:3845, SEQ ID NO:3847, SEQ ID NO:3849[ SEQ ID NO : 3853' 
SEQ ID NO:3855, SEQ ID NO:3857, SEQ ID NO:3859, SEQ ID NO:3861, SEQ ID NO : 3863' 
SEQ ID NO:3865, SEQ ID NO:3867, SEQ ID NO:3869, SEQ ID NO:3871, SEQ ID NO-3873' 
SEQ ID NO:3875, SEQ ID NO:3877, SEQ ID NO:3879. SEQ ID NO:3881, SEQ ID N03883* 
SEQ ID N0.3885, SEQ ID NO:3887, SEQ ID NO:3889, SEQ ED NO:3891 SEQ ED NO : 3893' 
SEQ ID NO.3895, SEQ ID NO.3897, SEQ DD NO:3899, SEQ ID NO:3901 ' SEQ ID NO 3903* 
SEQ ID NO:3905. SEQ ID NO:3907, SEQ ID NO:3909, SEQ ID NO.-391 1, SEQ ID NO^B 
SEQ ID NO:3915. SEQ ID NO:3917, SEQ ID NO:3919, SEQ ID N0.3921, SEQ ID NO : 3923 
SEQ ID NO:3925, SEQ ID NO:3927, SEQ ID NO:3929, SEQ ID NO:3931, SEQ ID NO : 3933 
SEQ ID NO-3935. SEQ ID NO:3937, SEQ ID NO.3939, SEQ ID NO:3941 SEQ ID NO 3943* 
SEQ ID NO:3945, SEQ ID NO:3947, SEQ ID NO:3949, SEQ ID NO:3951 SEQ ID No'-3953 
SEQ ID NO:3955, SEQ ID NO:3957, SEQ ID NO:3959, SEQ ID NO:396l' SEQ ID NO : 3963 
SEQ ID NO.-3967, SEQ ID NO:3969, SEQ ID NO:3971, SEQ ID N0.3973' SEQ ID NO : 3975 
SEQ ID NO:3978, SEQ ID NO:3980, SEQ ID NO:3982, SEQ ID NO:3984 SEQ ID NO : 3986 
SEQ ID NO:3989, SEQ ID NO:3991 , SEQ ID NO:3993. SEQ ID NO.3995 SEQ ID NoW 
SEQ ID NO:3999, SEQ ID NO:4002, SEQ ID NO:4005, SEQ ID NO:4007 SEQ ID NO : 4009 
SEQ ID NO:401 1, SEQ ID NO:4013. SEQ ID NO:4017, SEQ ID NO:4019 SEQ ID NO402l' 
SEQ ID NO:4023, SEQ ID NO:4025, SEQ ID NO:4027, SEQ ID NO:4029' SEQ ID NO-403 1 
SEQ ID NO:4033, SEQ ID NO:4035, SEQ ID NO:4037, SEQ ID NO:4039, SEQ ID NO : 4041 
SEQ ID NO:4043, SEQ ID NO:4045, SEQ ID NO:4047, SEQ ID NO:4049, SEQ ID NO : 4051 ' 
SEQ ID NO:4053, SEQ ID NO:4055, SEQ ID NO:4057, SEQ ID NO:4059, SEQ ID NO : 406l' 
SEQ ID NO:4063, SEQ ID NO:4065. SEQ ID NO:4067, SEQ ID NO:4069 SEQ ID NO : 407l' 
SEQ ID NO:4073, SEQ ID NO:4075, SEQ ID NO:4077 

SEQ ID NO:4079, SEQ ID NO:4081, SEQ ID NO:4083, SEQ ID NO:4085 SEQ ID NO 4087 
SEQ ID NO.-4089, SEQ ID NO:4091, SEQ ID NO:4093, SEQ ID NO:4095 SEQ ID NO 4097* 
SEQ ID NO:4100, SEQ ID NO:4102, SEQ ID NO:4104, SEQ ID NO:4106* SEQ ID NO-4108* 
SEQ ID NO:41 10. SEQ ID NO:41 12, SEQ ID NO.4114, SEQ ID N0:41 16 SEQ ID NO 41 18 
SEQ ID NO4120, SEQ ID NO:4122, SEQ ID NO:4124, SEQ ID NO:4126' SEQ ID NO : 4128 
SEQ ID NO:4130, SEQ ED NO:4133, SEQ ID NO:4135, SEQ ID NO:413s! SEQ ID N0 414l' 
SEQ ID N04143, SEQ ID NO:4145, SEQ ID NO:4147, SEQ ID NO:4149 SEQ ID NO : 415l' 
SEQ ID NO:4153, SEQ ID NO:41 56, SEQ ID NO:4158, SEQ ID NO:4160.' SEQ ID NO : 4162' 
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SEQ ID NO:4164 t SEQ ID NO:4166 

SEQ ID NO:4169, SEQ ID NO:417l, SEQ ID NO:4173, SEQ ID NO:4175. SEQ ID NO:4177, 
5 and SEQ ID NO:4179 

(c) a polynucleotide comprising a nucleotide sequence encoding a protein comprising an amino acid sequence 
selected from the amino acid sequences of (b), in which one or more amino acids are substituted, deleted, 
inserted, and/or added, wherein said protein is functionally equivalent to the protein comprising said amino 

10 acid sequence selected from the amino acid sequences of (b); 

(d) a polynucleotide that hybridizes with a polynucleotide comprising a nucleotide sequence selected from the 
nucleotide sequences of (a), and that comprises a nucleotide sequence encoding a protein functionally equiv- 
alent to the protein encoded by the nucleotide sequence selected from the nucleotide sequences of (a); 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a protein 
15 encoded by the polynucleotide of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence with at least 70% identity to the nucleotide sequence 
of (a). 

9. A substantially pure protein encoded by the polynucleotide of claim 8. 

20 

10. An antibody against the protein or peptide of any one of claims 6, 7, and 9. 

1 1 . A vector comprising the polynucleotide of claim 5 or 8. 

25 12. A transformant carrying the polynucleotide of claim 5 or 8, or the vector of claim 11 . 

13. A transformant expressively carrying the polynucleotide of claim 5 or 8, or the vector of claim 11 . 

1 4. A method for producing the protein or peptide of any one of claims 6 : 7, and 9, comprising culturing the transformant 
30 of claim 13 and recovering the expression product. 

15. An oligonucleotide comprising the nucleotide sequence of claim 8 (a) or the nucleotide sequence complementary 
to the complementary strand thereof, wherein said oligonucleotide comprises 15 nucleotides or more. 

35 16. Use of the oligonucleotide of claim 15 as a primer for synthesizing a polynucleotide. 

17. Use of the oligonucleotide of claim 15 as a probe for detecting a gene. 

18. An antisense polynucleotide against the polynucleotide of claim 8, or the portion thereof. 

40 

19. A method for synthesizing a polynucleotide, the method comprising: 

a) synthesizing a complementary strand using a cDNA library as a template, and using the primer set of claim 
2 or 3, or the primer of claim 1 6; and 
4 5 b) recovering the synthesized product. 

20. The method of claim 19, wherein the cDNA library is obtainable by oligo-capping method. 

21. The method of claim 19, wherein the complementary strand is obtainable by PCR. 

50 

22. A method for detecting the polynucleotide of claim 8, the method comprising: 

a) incubating a target polynucleotide with the oligonucleotide of claim 15 under the conditions where hybridi- 
zation occurs, and 

55 b) detecting the hybridization of the target polynucleotide with the oligonucleotide of claim 15. 

23. A database of polynucleotides and/or proteins, the database comprising information on at least one sequence 
selected from the nucleotide sequences of claim 8 (a) and/or the amino acid sequences of claim 8 (b), or a medium 
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on which the database is stored. 

5 

10 

15 
20 
25 
30 
35 
40 
45 
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55 
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Figure 1 
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Figure 2 
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Figure 3 
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